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Preface to the Second Edition 



I have taken the occasion of a second edition for enhancements and refine- 
ments that preserve the spirit of the original. I have more fully portrayed 
the remarkable relationships between symmetry group generators and their 
Lie bracket and the dynamical invariants of those symmetries and their 
Poisson bracket in Chapter 3. 

The radical departure of quantum waves from classical waves is more 
sharply drawn in Chapter 5. The spectacular way in which the states of 
the hydrogen atom unfold from its symmetries is the centerpiece of quan- 
tum Kepler motion. I now lead readers more deliberately through this 
unfolding by showing how the master quantum number orchestrating four- 
dimensional rotational symmetry orchestrates the hydrogen atom states. 

I offer readers in Chapter 7 a more descriptive portrait of the break-up 
of regular Kepler motion on tori into periodic resonant points when the 
motion becomes chaotic. And I now include a sketch of KAM Theory — 
the greatest achievement in celestial mechanics of the twentieth century — in 
properly filling out the story of chaotic Kepler motion. Lastly, a new edition 
has given me the opportunity to correct errors. 

The Fields of Michael and Gabriel David Oliver 

Meadows of Dan, Virginia, USA 
Autumn 2002 




Preface to the First Edition 



The universe embraces with sensual presence. No thought, no word is 
needed to behold sky, sun, moon, stars. It is not in reflection but in experi- 
ence that one inhales the earthy smells of damp woods, becomes dizzy under 
the sweep of the starry night sky, basks in the sun’s warmth, rolls in ocean 
breakers, tastes salt. The sensual universe is open, available, immediate. 

But beneath its sensuality, the universe has an interior landscape that lies 
hidden. Though veiled, the inner nature of the cosmos draws us. It invites 
us to find the underlying source of the motion of sun, moon, and stars, 
search out the primal form of all matter, and discover the unity beneath 
the diversity of the earthy substances that excite the senses. The inner 
nature of the cosmos is a landscape of exquisite beauty which appropriately 
underlies the exterior splendor of our experience. This interior landscape of 
matter and motion which lies both in and beyond the senses — the physical 
face of the universe — is mathematical. 

That the world we see, smell, and touch is deeply mathematical is a 
fact of unceasing amazement to me. The natural world presents us mathe- 
matical symmetries of compelling beauty; and a rich vein of mathematics, 
inspired by these raw materials from the natural world, is potently phys- 
ical. The physical and mathematical resonate, creatively nourishing one 
another. Although the sensual face of the world presents itself to us quite 
openly, its physical face, hidden as it is in mathematics, must be gained by 
following a path of reflection and study. 

The physical world opens to us through mechanics , the science of motion. 
Mechanics is the wellspring from which physics flows, the source out of 
which a multitude of applications and elaborations from astrophysics to 
elementary particles emerges. Insight into the interior nature of the physical 
world begins with mechanics, with a vision of the motion that underlies all 
the objects of the world and the subject of this book. 

For whom is this book intended? It is intended for lovers. As a young 
man, I was drawn by love of the physical world and the things one could 
make from it through knowledge of its interior, mechanical behavior. I 
followed my fascination into the study of physics and engineering. I found 
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myself not only conceiving and developing nuclear and electric rockets and 
sources for power in space with this knowledge but also sharing that vision 
by teaching other young men and women like myself. 

Then, slowly, as flickering symptoms scarcely cross across one’s con- 
sciousness but steadily rise up in the body to reveal a raging disease, the 
things I was doing with the physical world rose up to confront me. I had 
begun with delight in scientific discovery and engineering craft. I ended 
up developing sources of massive energy and power for a new generation 
of space weaponry — well before the movie and the reality of Star Wars. I 
had been drawn by the beauty of the starry night sky. I ended up in the 
first ambitious attempt of the human race to fill that sky with devasta- 
tion. My innocence gave way to the realization that mechanics is a science 
both wonderful and terrible. Mechanics is the vehicle of all physical theory. 
Mechanics is the vehicle of war. The two have been inseparable. 

Mechanics originated in the earliest myths of human origins. Cain was a 
man born into matter, a worker of the soil given to wonder about force and 
motion. He took a piece of the natural world and mechanically fashioned 
it into a plowshare. Then he took that plowshare, split Abel’s head, and 
simultaneously became the inventor of the sword. Physics, a reveler in du- 
alities, has never escaped its primal duality in the sword and the plowshare. 

In the time of the Greeks our science received its name: mechanics 
(larixoi^v) — a device for delusion in warfare. The practitioners of Cain’s 
art had moved on in mechanical prowess. They beat their plowshares into 
the armed bowels of mechanical horses that could simultaneously split a 
multitude of heads. Mechanics, originating as delight in force and motion, 
suffered to become the handmaid of humanly wrought power and delusion 
in warfare. 

Seventeenth century Venice, an early superpower, showed the world how 
to organize mechanical wondering and tinkering under the minions of the 
arsenal; and the arsenal proliferated beyond renaissance imagination to 
become the superarsenals of the modern world. The arsenal claimed the 
most distinguished physicists and engineers of the mid-twentieth century 
in our own recapitulation of the plowshare and the sword. These men and 
women hammered Albert Einstein’s beautiful plowshare of nature’s fields 
of motion, £ = me 2 , into the most hideous transformation of the sword yet 
to appear. 

Is it possible to have a celebration of mechanics with that dreadful her- 
itage? Let me answer with a story from the mountains in which I now 
live. A notorious local man was known as “the awfulest man.” Not only 
was he scheming and exploitative; he could be dangerous. When he died, 
folks came to the funeral more to hear what the preacher would have to say 
about the awfulest man than to mourn. The preacher began with silence. 
Then, looking squarely at the congregation, he broke into a dreamy smile 
and said, “Didn’t you just love to hear that man whistle!” 

As surely as a wondrous side of the awfulest man could be recognized 
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by the melliflous trilling that announced his presence, mechanics too can 
be recognized as a wondrous, if blighted science. Mechanics has a voice in 
praise of the creation. Mechanics can dance — other than to the tune of the 
piper. After fleeing the work of the arsenal, I remembered and longed for 
the ethereal whistle of mechanics; and I began to write. 

Looking to the redemption of the dreaded science, I turn from humanly 
fabricated delusion to naturally created wonder. Look to mechanics as 
the trance of nature rather than the delusion of invincible safety. Think of 
mechanics as a gift of nature rather than the purveyor of devices to threaten 
and dominate. So set aside the monstrosities into which the plowshare has 
been beaten and gather round. Let me tell the story of the inner nature of 
the world from the scale of the solar system down to the scale of the atom. 
My story speaks of both the physical world and human imagination, of the 
heavens and the elements and the individual human beings past and living 
entranced with the nature of the universe. 

Telling about physics is a part of story-telling appropriate to the modern 
world. It is in the story that the collected knowledge of a culture is gathered, 
celebrated, and passed on — and physics is no exception. It is my hope that 
this story of the physical world may touch and delight others, particularly 
those just coming of age in physics and mathematics and caught up in 
its fascination. It is to you that I especially offer this book as a spirited 
companion; for I wrote the book I would have loved to have placed in my 
hands as a young student. 

My tale of mechanics is not an alternative to rigorous study and the 
exercise of problem-solving required to master physical principles. Nor is it 
intended as a diversion from masterful texts such as Arnold Sommerfeld’s 
Mechanics, the ebullient works of V. I. Arnol’d in mechanics, or the Me- 
chanics and the Quantum Mechanics of L. D. Landau and E. M. Lifshitz. 
I offer my telling of this traditional story as a soulful companion to the 
textbooks. 

Experienced old-timers know thes stories intimately. I would be de- 
lighted if they find a thing or two to catch their fancy in my rendering 
of the tale. (Stories, especially the most traditional, are never retold in 
precisely the same way.) 

It is my hope that those whose main interests lie outside physics may 
also be drawn into this story of the mathematical splendor of the world; 
and I welcome you. Mathematics is a central mode of expression in this 
kind of story-telling. Mathematics is part of our language, an important 
means of perception and discernment. I suggest that mathematical physics 
is not only a language but a visual art whose structure, symmetries, and 
iconography reflect the symmetry and beauty of the world that it describes. 

Basic undergraduate mathematics for science, engineering, and mathe- 
matics students provides a sufficient level with which one may enter into 
the story. For those readers with deeper interest and greater mathematical 
preparation I offer a set of notes with more extensive (and in some cases 
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more advanced) mathematical development. These notes constitute the 
second half of the book so they do not interrupt the flow of the story. The 
notes are distinguished from the main text by boldface titles and equation 
numbers such as Note 1, Eq. (1.1). 

This story would not be worth telling if it were not a story of mystery. 
The sword of mechanics proclaims the profane. But the plowshare of me- 
chanics parts the earth revealing the sacred in matter. Each time the story 
is retold, glimpses of that mystery become possible. I pray that will happen 
here. 

The Fields of Michael and Gabriel David Oliver 

Meadows of Dan, Virginia, USA 
Autumn 1993 
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Chapter 1 



The Shaggy Steed of Physics 



In the story of Prince Conn* a peaceful and prosperous kingdom has fallen 
into chaos. A stepmother’s intrigue has cast a deadly spell over the youthful 
monarch and his realm. Helpless to find the way to save his life and restore 
harmony to the kingdom, the prince is directed by a druid to a small shaggy 
horse. The shaggy steed offers to conduct the prince upon a quest through 
harrowing trials of fire and flood and faithfully bring him to a magical realm 
where he will procure powers with which he can break the spell and restore 
the harmony of his land. 

The shaggy steed proves to be an extraordinary mentor to the prince. 
They embark upon a fabulous journey in which he brings the young man to 
an understanding of the forces hidden in the universe and in himself. Under 
the tutelage of this creature from the natural world the prince is prepared 
to enter the magical kingdom and then return to rule his own realm. 

But at the threshold of success, the horse, whom the prince now loves 
dearly, will go no further with him. Instead, he bids the prince take a knife 
and a vial of ointment from a hiding place within his ear. Then he directs 
the youth to slay him, flay his hide, and wrap himself in it. It is through 
his sacrifice that the prince will be able to make his way unscathed into the 
magical kingdom. 

“Never!” proclaims the prince in horror. 

But the prince’s mentor is implacable. “If you fail in this final task,” 
he implores, “you will perish and I shall endure a fate worse than death. I 
only ask that after you achieve your goal you return to my body, put away 
the vultures, and cover it with the ointment.” 

Reluctantly, the prince takes up the knife and uncertainly points it at 
his teacher. Instantly, as if impelled by powers beyond the young man’s 
hand, the knife plunges into the throat of the horse and the dreaded act is 
done. 

With broken heart, the prince makes his way through the magical king- 
dom shrouded in the hide of his teacher, procures the magical artifacts, 

* “The Story of Conn-eda” in W. B. Yeats (ed.), Irish Fairy and Folk Tales , New York: 
Dorset Press, 1986. 
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and prepares to return to his own kingdom. But first he searches out the 
carcass of his slain guide. Driving off the vultures, he annoints the remains. 
Thereupon the dead flesh undergoes a magical transformation, achieving 
the form of a noble prince like himself. In a joyous reunion the prince 
recognizes both his former teacher and a new, unexpected companion. 

I am reminded of this tale as we set out to learn of the motion, symmetry, 
and beauty of nature; for our own experience of the physical world is similar 
to that of Prince Conn. Like the world of the prince, the world of physics can 
be plunged into chaos. Familiar understanding can crumble in contradiction 
as happened when deepening inconsistencies and new observations with 
the telescope undermined the long tradition of the Ptolemaic theory of the 
universe.* 

In the midst of the confusion and uncertainty of such a breakdown, a new 
idea appears. At first it seems unpromising, even false as did Copernicus’s 
vision of a heliocentric universe to the affirmed practitioners of Ptolemaic 
astronomy. Yet once we grasp its power and beauty, we embrace the new 
theory with conviction. We grow to affirm it as a cornerstone. We even fall 
in love with it. Then, in the face of relentless probing and deepening obser- 
vation of the world, our beloved theory leads us to a crisis of contradiction. 
We must give it up. 

The calculations of Kepler and the telescope of Galileo opened the way to 
the classical mechanics of Isaac Newton. Newton’s laws of motion reigned 
supreme for over two hundred years achieving a brilliant synthesis in the 
principle of least action through the work of Leonhard Euler, Joseph-Louis 
Lagrange, C. G. J. Jacobi, and William Rowan Hamilton. Then the dis- 
covery of the atom thrust Newtonian mechanics into crisis. In the midst 
of this crisis Max Planck and Neils Bohr proposed the bewildering idea 
of the quantum. And quantum mechanics came to supersede Newtonian 
mechanics. 

In our time we acclaim the quantum mechanics of P. A. M. Dirac and 
Richard Feynman. But it is a remarkable fact that the classical mechanical 
heritage of Newton, Kepler, and Galileo and particularly that of Euler, 
Lagrange, Jacobi, and Hamilton is both potently present and wondrously 
transformed in quantum mechanics. The new theory rises from the old. 
Like Prince Conn, we affirm the new knowing we embrace a metamorphosis 
of the old. 

There is a further connection to the shaggy steed appropriate to my 
own telling of the story of the physical world. It is in the smallness and 
homeliness of the mentor-horse. The teacher I have selected is a small and 
homely bit of physics. Our guide will be the two-body problem, a simple but 
mathematically exact description of the motion of two bodies that interact 
through the inverse-square force of gravity and electricity (the force acting 

* Giorgio de Santillana has painted a sweeping and poignant portrait of this breakdown 
in The Crime of Galileo, Chicago and London: U. of Chicago Press, Midway Reprints, 
1976. 
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between the two bodies has a strength that is proportional to the inverse- 
square of the distance of separation between them) . The two-body problem 
is the Shaggy Steed of Physics; and it shall carry us upon a mathematical 
journey in which the unity and beauty of the heavens and the elements will 
be revealed. 

The two-body problem first appears as a little problem. It flows by as 
one among many textbooks shower down upon students. But those willing 
to look more deeply will find a jewel casting its glow over the celestial 
realm of Kepler and Newton to the quantum realm of Bohr and Dirac. 
In its classical form the two-body problem gives us the Kepler ellipse, the 
exemplary motion of the heavens of our solar system. In its quantum- 
mechanical form it yields the hydrogen atom, the prototypical element. In 
these two achievements alone the two-body problem is a moving paradigm 
of the unity and beauty of the physical world. But there is more. 

In its origin in a pristine variational principle and in its beautifully knit 
symmetries the two-body problem heralds the affirmed attributes and cher- 
ished aesthetics of unified theories of all forces and particles. Symmetry 
describes the way objects, when transformed, nonetheless present the same 
identical shape. For example, the symmetries of a sphere are rotations. 
Rotate a sphere from one angular position to another and one has the same 
thing with which one began. Symmetries reflect themselves in invariants: 
quantities that do not change under the action of the symmetry. The an- 
gles specifying the orientation of the sphere change as it is rotated, but 
the radius of the sphere does not. The radius is an invariant of rotational 
symmetry. 

The symmetries of two-body motion with inverse-square forces are most 
remarkable. They are both real and imaginary four- dimensional rota- 
tions. Just as the rotational symmetry of the familiar sphere embedded 
in three-dimensional space is a generalization of the rotational symmetry 
of the sphere in two dimensions (the circle), we shall see that a sphere 
in four-dimensional space is a generalization of the sphere embedded in 
three-dimensional space. All are the same object with an invariant radius, 
but in different dimensions. A sphere with a real radius gives rise to ordi- 
nary trigonometric functions. When the radius is imaginary, the sphere has 
imaginary rotation angles that give rise to hyperbolic rather than trigono- 
metric functions. 

The four-dimensional rotational symmetry of the Kepler problem is in- 
timately connected to the two majestic achievements of twentieth century 
physics: quantum theory and relativity theory. When the rotations are 
real, the radius of the sphere is the Planck constant, the emblem of all 
quantum motions. When the rotations are imaginary, they peal with di- 
vine laughter. The four-dimensional rotational symmetry is then similar to 
that of the relativistic space-time discovered by Albert Einstein. 

The motion of two bodies with inverse-square forces and Einsteinian 
relativity have nothing directly to do with one another. But because of 
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the surprising coincidence of their symmetries we shall find that unique 
manifestation of relativistic symmetry — the spinor -to also be one of the 
natural expressions of two-body celestial motions. (Spinors are quantities 
similar to vectors discovered by P. A. M. Dirac in working out the relativistic 
description of the electron.) Moreover, the rotational symmetry of the two- 
body problem is a paradigm for the unitary symmetries of quarks, the most 
elementary of particles, and pointed the way to them. All these diverse 
aspects of physics are reflected in the motion of two bodies bound by the 
inverse-square force. 

The symmetries of the two-body problem enfold a beautiful set of topo- 
logical relationships that open our eyes to the way in which the topology of 
orbits powerfully summarizes motion — vision that cannot be had by sim- 
ply looking at solutions of differential equations. The four-dimensional ro- 
tational symmetry heralded by the two-body problem with inverse-square 
force is extraordinary in its topology: a four-dimensional rotation (and its 
symmetry sphere in four dimensions) may be decomposed into a pair of 
three-dimensional rotations (and their symmetry spheres in three dimen- 
sions). This decomposition is exceptional; rotations in all other dimensions 
cannot be decomposed into rotations in lower dimensional spaces. Only a 
space of four dimensions has this property. 

Though the world of physics it illuminates is vast, the two-body problem 
has limits. Like Prince Conn’s steed, a faithful and informative guide is 
respectful of its limitations. At the boundary of its realm the motion of 
two-bodies, like the prince’s steed, must be given up. 

One of Isaac Newton’s laws of motion proposed in the Principia Math- 
ematica of 1666 was that the force experienced by one body is equal and 
opposite to that experienced by the other. This is true for gravitational 
forces in the solar system and electrical forces in the atom for which mo- 
tion takes place at a small fraction of the velocity of light. But in the realm 
of velocities near the speed of light electrical and gravitational forces are no 
longer instantaneously equal and opposite. Instead, they propagate from 
body to body at a finite speed, the speed of light. In that revolutionary 
discovery the space and time established by Galileo and Newton is revealed 
as only a limiting form of Einstein’s even more graceful vision of relativistic 
space and time. 

In the regime of motion in which forces propagate from body to body at 
a finite speed, two-body interactions cannot exist. New particles enter the 
picture which carry the propagating force. Motion near the speed of light 
inherently couples infinitely many bodies, a situation radically different 
from the motion of two bodies. 

In the ordinary world of electricity and gravity with velocities small 
compared to the velocity of light, the two-body problem reaches another 
limit at precisely two bodies. Ordinary motion beyond two bodies — even 
that with just three bodies such as two planets and the sun — is not just 
quantitatively different from that of two bodies; it is qualitatively different. 
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There is a singular rift between two-body motion and the motion of many 
bodies. Only two-body motion may be described by simple mathematical 
objects such as the ellipse of Kepler. On the other hand, the motion of more 
than two bodies — the so-called many-body problem — cannot be expressed in 
the most general functions of classical mathematical analysis, infinite series 
of algebraic polynomials. 

The motion of two bodies bound by the inverse-square force possesses 
a crucial property: it preserves the integrity of the initial conditions. Or- 
derly transformation of the initial conditions by the subsequent motion of 
the bodies occurs because the motion of two bodies is insensitive to small 
fluctuations. Neighboring trajectories emanating from closely neighboring 
initial points always remain close to one another. Orderly behavior like this 
is describable by conventional mathematical functions. 

Whereas two-body motion preserves the integrity of the initial conditions 
in the sense described above, many-body motions generally do not. A 
small change in the initial conditions of many-body motion may lead to a 
dramatic divergence in the resulting trajectories. This sensitivity destroys 
the information contained in the initial condition in the subsequent course 
of the motion. A new feature which is impossible with two-body electrical 
and gravitational forces appears which challenges predictability: complex 
turbulent motion better known as chaos. 

Suddenly a surprising and paradoxical freedom springs from the deter- 
ministic laws of mechanics. Motion frees itself from the pattern and struc- 
ture imprinted in the initial conditions by escaping into chaotic behavior. 
Many-body motion is governed by a deterministic set of mathematical equa- 
tions; but these equations yield random rather than deterministic behavior. 
One encounters one of the most remarkable and still puzzling aspects of 
physical law — freedom and determinism coexisting together. 

The vision of the world held by our forebears in antiquity and in the 
medieval era was one in which the heavens and the elements formed a cosmic 
whole. This vision was inspired by order and symmetry; but the order and 
symmetry that the ancients and medievalists envisioned were not rooted 
in penetrating observation of the natural world. Like Prince Conn’s quest, 
our journey begins with disintegration, the disintegration of a medieval 
kingdom of astronomy, philosophy, and politics, a disintegration spurred 
by new observations of the natural world obtained through instruments 
like the telescope. 

We then describe the new unity that began to replace the old, the vision 
of a universal law sustaining all the motions of the universe. The law of 
motion flows from a principle of extraordinary simplicity — the principle of 
least action. We direct this principle to the classical dynamics of two bodies. 
The classical regime is crowned by celestial mechanics. 

Next we consider the extension of mechanics into the quantum world 
of microscopic particles. The quantum regime is crowned by the hydrogen 
atom. We then come to the limits of the two-body problem whereupon we 
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give up our guide before the relativistic nature of gravitational and electrical 
forces on the one hand and the rich structure of many-body motion with its 
freedom within constraint on the other. And in this transformation of the 
two-body problem into the many-body problem we encounter a surprising 
reunion with our familiar guide. 




Chapter 2 



The Heavens and the Elements 



From the earliest times humankind has been drawn to the unity of the 
creation. The ancients understood the large-scale bodies of the heavens 
and the smallest bits of elemental matter as intimately related parts of a 
cosmic whole. They sought to explain that unity with the geometrical and 
religious concepts of their era. 

The medieval world view derived from the Greeks was, for its time, a 
thing of great beauty. The universe was seen as a series of seven concentric 
spheres surrounding the earth. Each was the habitation of one of the moving 
heavenly bodies. Proceeding outward from the earth they were the moon, 
Mercury, Venus, the sun, Mars, Jupiter, and Saturn. Each sphere was the 
seat of one of the seven chemical elements: silver, mercury, copper, gold, 
iron, tin, lead. Each heavenly sphere resonated with a note of the western 
musical scale: re, mi, fa, sol, la, ti, do. Learning in the Middle Ages 
was itself congruent with this cosmic order. The seven disciplines were 
grammar, logic, rhetoric, arithmetic, music, geometry, and astronomy. 

This vision of the universe also provided the medievalists with an expla- 
nation of the unity between spirit and matter. The region beyond the outer 
spheres of the universe was the Empyrean — a realm of pure spirit thought 
to be inhabited by angelic creatures much as men and women inhabit the 
purely material earth at its center. For the popular mind God too was lo- 
calized in space. Beyond the sphere of Saturn, God dwelt upon His throne 
in the Empyrean just as the king dwelt above the nobles, peasants, and 
fields of his realm. 

In the hierarchy from pure spirit in God’s enveloping realm to the mate- 
rial earth at its center the heavenly spheres embodied progressively greater 
proportions of spirit to matter. Human beings came into existence with the 
movement of their souls from God through the spheres down to earth. Upon 
death the soul migrated back out through the cosmos losing its material 
elements appropriately to each sphere until it arrived in God’s realm. 

The medieval insight that the cosmos was unified in beauty and simplic- 
ity was correct. But the medieval understanding of the interplay between 
spirit and matter and the mechanism proposed for the cosmic unity of the 



7 




The Shaggy Steed of Physics 



heavens and the elements were in error. Beautiful theories can sometimes 
end in error. 

Beginning with the Renaissance, the ambition of a unified vision of the 
spiritual and material aspects of the cosmos and human existence gave way 
to the more narrow quest for an understanding of the physical world, a 
narrowing of focus that continues to this day. 



Through a Glass Darkly 

The vision of the universe as a system of spheres reveals the human 
capacity to see geometry in nature. These geometrical structures, even for 
the ancients, were more than stationary artifacts. The heavens were in 
motion. The planets were wandering stars. The geometrical reckoning of 
the heavens went beyond the ordering of the heavenly bodies in a coherent 
geometry. That same geometry could be used to predict their motions. 

The planets were understood to be bodies moving on circular orbits that 
lay upon the celestial spheres. A radius and a speed of rotation (or angular 
velocity) could be deduced for each planet. With these two properties the 
motions were not only describable; they were predictable. 

But the perfect circle did not perfectly predict the motions of heavenly 
bodies. There were serious discrepancies in the predictions afforded by a 
model of planetary orbits moving on concentric circles. The modifications 
that were made to the description of the heavens of spheres and circles 
came to be known as the Ptolemaic system. This system was named af- 
ter Claudius Ptolemaeus, the second century compiler of the Almagest , a 
monograph of the best astronomical formulas of the time. 

The Ptolemaic system embraced the circle as the geometrical form of 
the heavens and accepted the earth as the center of the universe. But 
it introduced many additional circles to bring the predictions into accord 
with the observations. Instead of a planet moving on single circle rotating 
at a single angular velocity about the center of the universe (called the 
deferent), the planet was proposed to move on a second circle (called an 
epicycle) which itself rotated on the deferent. The resulting motion of the 
planet was no longer a perfect circle but a swirling rosette, as shown in 
Fig. 2-1. 

Each planet was burdened with two circles and two angular velocities. 
This cumbersome arrangement proved useful because the epicycle had the 
subversive effect of displacing the true center of rotation of a planet from 
the earth (the assumed center of the universe). The displacement of the 
center of rotation hidden in the mathematics ruined the notion of a “center 
of the universe;” but it vastly improved the predictions of early astronomy. 

The prediction of the orbit shape was a challenge for Ptolemaic astron- 
omy; but the prediction of the speed of a heavenly body over the course 
of its orbit was a more trying challenge. Heavenly bodies were to move 
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Figure 2-1. The Elements of Ptolemaic Astronomy. The earth is the center of the 
universe and the sun is one of the planets. Planets move on epicycles which themselves 
move on deferents centered on the earth. The epicycle moves with the equant which 
rotates uniformly about a center displaced from the center of the earth. 



at a uniform speed of rotation on their circles in correspondence with the 
classical idea of perfection. But uniform speed, like the perfect circle, failed 
to match the observations. 

Ptolemaic astronomers sought to correct this discrepancy by introducing 
the equant. The equant was a ray centered on a point slightly displaced 
from the center of the earth and drawn to the center of the epicycle as 
shown in Fig. 2-1. The equant was to rotate at the uniform speed of the 
heavenly spheres and the epicycle tracked it while moving on its deferent as 
shown in Fig. 2-1. As a result, the planet acquired a nonuniform speed that 
better matched the predictions. The Ptolemaic description of the heavens 
was one in which circles rode upon circles that did not lie at the center of 
the universe. 

Galileo brought measurement and the sharp eye of visual observation, 
enhanced by instruments like the telescope, to this ancient conception of the 
world. So too did Tycho Brahe. Even with the tinkering of epicycles and 
equants, Ptolemaic astronomy could not describe the planetary motions to 
the degree of accuracy with which they could be measured. If the medieval 
and Ptolemaic view of the world’s behavior did not conform to careful 
and rigorous observation, it would be denied its beauty. By Greek and 
medieval standards themselves, that which is not true can be neither good 
nor beautiful. The heavenly spheres, displaced from their center on the 
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earth and cast in Ptolemaic epicycles and equants, had already suffered the 
loss of much of their beauty. 

While Galileo and Brahe devoted themselves to rigorous observation of 
heavenly motions, Nicholai Copernicus and Johannes Kepler were inspired 
by the belief that the cosmos really was beautiful. A cumbersome Ptole- 
maic model was a flawed image of the cosmos even if its orbit predictions 
could be brought into close correspondence with observations. Yielding the 
geometric centrality of the earth as the center of the universe, Copernicus 
temporarily regained the beauty of the ancient world view by establishing 
the sun at the center of the solar system. The Ptolemaic system seemed 
saved by the simple device of placing the sun at the center. But epicycles, 
equants, and deferents, though simplified, could not be fully dispensed with 
(particularly in the motion of a nearby planet like Mars with an eccentric 
orbit); and even with them Ptolemaic predictions still departed from the 
accurate observations of Tycho Brahe. 

Kepler himself had been a superb practitioner of Ptolemaic astronomy. 
He painstakingly constructed a combination of deferents, epicycles, and 
equants that provided the most accurate prediction of the motion of Mars 
yet achieved. Kepler’s Ptolemaic model culminated in a representation of 
the motion of Mars that matched Tycho’s observations with less than 8 
minutes of error in arc. This difference is scarcely noticeable by the naked 
eye. Nonetheless, Kepler took this small discrepancy as certain evidence 
that the Ptolemaic basis of astronomy was fundamentally in error.* 

Kepler then found that a single geometric shape, the ellipse with the sun 
at one of the foci, fit the actual orbits of all the planets almost perfectly. 
(The foci are the generating points of the ellipse displaced from the center 
by a distance that is a measure of its eccentricity.) An ellipse with the sun 
at a focus could account more accurately for the heavenly motions than the 
ponderous Ptolemaic universe with its deferents, epicycles, and equants. 

Kepler wlrent on to show that this new mathematical description had 
even greater powers. In his search for a method of predicting the nonuni- 
form speeds of the planets correctly he found that a planet moved at a 
speed over its orbit for which the time taken to sweep out a fixed amount 
of area is always the same. Visualize the planet moving on its ellipse as 
shown in Fig. 2-2. Imagine wedges of equal amounts of area cut out of 
the elliptical surface contained by the orbit. The time it takes the planet 
to sweep over the elliptical path along the edge of each wedge is always 
the same. As a result, the planet moves at a nonuniform speed, sweeping 
rapidly over the portion of its orbit near the focus at the center and more 
slowly about the distant focus. 



* Kepler wrote, “Divine Providence granted us such a diligent observer in Tycho Brahe 
that his observations convicted this Ptolemaic calculation of an error of S’: it is only 
right that we should accept God’s gift with a grateful mind. . . . Because these S’ could 
not be ignored, they alone have led to a total reformation of astronomy.” [Johannes 
Kepler Gesammelte III , 178, Munich (1937-)] 
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Figure 2-2. Kepler’s Second Law. A planet sweeps out equal areas of its ellipse in equal 
times. The two wedge-shaped shaded areas are equal. The planet takes the same time 
to cover the distance from A to B as that from C to D. As a result, the planet moves 
at a nonuniform speed over its orbit. 



It is a tribute to Kepler’s respect for the primacy of well-verified obser- 
vation against his quest for the perfect geometry of the heavens that he 
stuck with the ellipse even as he felt it lacking. He was drawn to the ellipse 
by its stunning prediction of the periods of the planets. But he found it 
wanting because it did not possess the perfection of the sphere. We now 
know there is a perfection in Keplerian orbits (in their origin in the Laws 
of Motion discovered by Isaac Newton and in their hidden symmetry in the 
inverse-square gravitational force not fully revealed until the modern devel- 
opments in geometry and topology of the twentieth century). The hidden 
perfection of the orbit that elluded Kepler will be revealed in the course of 
this book. 

The explanatory power of the Kepler ellipse seemed inexhaustible. Ke- 
pler also found that the time for a complete orbit of any planet was inversely 
proportional to the | power of the mean distance of the planet from the 
sun. He wrote, “I first believed I was dreaming; but it is absolutely certain 
and exact that the ratio which exists between the periodic times of any two 
planets is precisely the ratio of the § powers of the mean distances.” 

Kepler pointed the way to a deep-seated law that governs all motion. 
The insight of the ancient astronomy that nature was built upon beauty 
and simplicity was correct; but that beauty and simplicity were not rooted 
in spheres and uniform motion. The inner beauty and simplicity were 
resident in an underlying physical law. Here was a new kind of perfection 
that permeated the entire heavens and did not depend upon the specific 
properties of any planet, a beauty more mystical than that of the heavenly 
spheres. Isaac Newton would explicitly exhibit that law: the gravitational 
force binding the bodies of the solar system is inversely proportional to the 
square of the distance of separation between the bodies. 

The Pattern of the Elements 

Copernicus, Kepler, Galileo, and Newton shed great light upon the large- 
scale cosmos. But what of elemental matter itself? What were the ultimate 
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microscopic constituents out of which the stuff of the cosmos was made? 
With the rending of the celestial spheres the musical resonance between 
the heavenly spheres and the elements was silenced. The vision of celestial 
bodies and elemental matter as a cosmic unity fell into oblivion as the in- 
vestigation of elemental matter followed an earth-bound course far removed 
from the motions of heavenly bodies. 

A small number of elements with distinctly varied strength, maleability, 
lustre, melting and boiling points, and combustible properties were found 
to build up all matter. The classical set of elements — silver, mercury, cop- 
per, gold, iron, tin, lead — were all metals. New elements such as carbon, 
bismuth, and sulfur expanded those known by the medievalists. The gases 
hydrogen, oxygen, and nitrogen were well identified by the late eighteenth 
century. In the middle of the nineteenth century well over fifty elements 
were known. 

By this time it had become clear that elements were associated with 
multiples of an indivisible unit weight. The weight of a particular kind 
of element provided a useful way of labeling the elements since chemical 
reactions consumed reactants and produced products with weights in well 
established ratios. The distinguishing feature of an element came to be that 
it was a macroscopic aggregation of a single kind of unit. Matter was not 
continuous and infinitely divisible. It was built up of irreducible elements 
with invariant properties. 

The various elements fell into groups with similar chemical and physical 
properties. Repeating patterns of properties followed increasing multiples 
of atomic weight. An arrangement of atomic weights in octaves ( Newland’s 
Octaves, 1850) neatly grouped elements with similar properties but with 
different atomic weights. By the end of the century, the Russian chemist 
Dimitri Ivanovich Mendeleev had brilliantly constructed a Periodic Table of 
the Elements in which the repeating patterns of the elements were revealed. 

Mendeleev’s table, published in 1872, is shown in Fig. 2-3. It gave birth 
to the modern Periodic Table shown in Fig. 2-4. Mendeleev recognized 
the natural grouping of the elements in eight fundamental groups of the 
first two periods; but the third and fourth period -as well as the higher 
periods — were less clear. This was because after two periods, elements 
group themselves in eighteen groups rather than eight as shown in Fig. 2-4. 
Moreover, the eighth group, which was ultimately recognized as the abode 
of the noble gases, could not be properly filled in 1872 because the nobles 
were inaccessible to the chemical methods of the time. 

Using the pattern in the table, Mendeleev predicted the existence of 
elements which had not yet been discovered. For example, he proposed 
the element he called ekaboron with atomic weight 44 to fill the vacant 
slot in row IV of the third column. The element, a silvery white metal 
with pinkish cast in air, was discovered only a few years later (1879) in 
Scandanavia and has come to be known as Scandium, element number 21 
of Fig. 2-4. Mendeleev also corrected errors in the published weights of 
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Figure 2-3. The Mendeleev Periodic Table of 1872. The chemical elements are not without order. They may be arranged in 
groups with similar chemical and physical properties forming the eight columns of the table. Similar elements have a repeating 
pattern in their atomic weights giving rise to repeating rows ( Liebigs Ann. Chem. Suppl. viii, 1872). 
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Figure 2 ~4- Periodic Table of the Elements. The diversity of the chemical elements 
is imprinted with a fundamental pattern. Each column of the table groups elements 
with similar chemical properties from the extreme chemical reactivity of the alkali 
metals on the far left to the inertness of the noble gases on the far right. When 
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III 



IV 
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ordered by their atomic number (which is roughly one- half the weight), progressively 
higher numbered elements are found to be similar in chemical and physical properties 
to lower numbered elements and may be grouped with them forming rows. The rows 
periodically repeat in the pattern 2 + 8 + 8 + 18 + 18 + -- -. 
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elements because they did not properly match the patterns in the table. 

The Periodic Table provokes obvious questions. For example, some of 
the most reactive elements are those in the first column — the alkali metals 
such as lithium, sodium, potassium, and rubidium. These elements are 
chemically similar. They are soft metals at the pressures and temperatures 
at the earth’s surface. Put any of them in the presence of oxygen and fiery 
results ensue. Put them in water and hydrogen gas is released. Yet their 
atomic weights differ dramatically. Rubidium with an atomic weight of 85 
is an order of magnitude more massive than lithium. How can elements of 
such disparate atomic weight possess nearly identical chemical properties? 

If the elements are ordered by number which is directly correlated with 
atomic weight (the number is roughly one-half the weight), these same 
elements reveal a persistent pattern: H (hydrogen) 1 = 1, Li (lithium) 
3 = 2 + 1, Na (sodium) 11 = 2 + 8 + 1, K (potassium) 19 = 2 + 8 + 8 + 1, 
Rb (rubidium) 37 = 2 + 8 + 8 + 18 + 1. The period is indeed eight in the 
pattern of Newland’s octaves; but only for two repeats: it then jumps to 
repeats of eighteen. 

All these highly active elements have one thing in common: their atomic 
numbers exceed the periods in the pattern 2 + 8 + 8 + 18 + • • • by 1. Re- 
markably, the elements that possess this very pattern are those with prop- 
erties most radically different from the alkali metals: the noble gases on the 
opposite side of the table. The nobles such as He (helium), Ne (neon), A 
(argon), Kr (krypton), and Xe (xenon) are inert — they do not chemically 
combine with other elements; but they are almost identical to the alkalis in 
atomic number. He 2 precedes Li 3 = 2 + 1. Ne 10 = 2 + 8 precedes Na 
11 = 2 + 8 + 1. A 18 = 2 + 8 + 8 precedes K 19 = 2 + 8 + 8 + 1, and so 
on. The small difference of one unit of the alkali metals over the completed 
pattern blocks of the noble gases signals a radical chemical difference from 
inertness to extreme reactivity. 



A New and Deeper Unity 

The Periodic Table was a scheme of heavenly spheres writ small. It 
described patterns in the microscopic world just as the heavenly spheres 
described the patterns of the wandering stars. Periods of 2 , 8 , and 18 were 
deeply etched in the Periodic Table. Could there be be an underlying unifi- 
cation of these numbers in a law of mechanics just as the laws of mechanics 
provided a simple unifying explanation for the orbits and transit times of 
the heavenly bodies? 

The sun, the moon, and the planets exhibit themselves as bodies in 
motion bound by forces. But when one examines a fragment of gold or a 
lump of charcoal, one does not see bodies in motion bound by forces. One 
looks upon a contiguous piece of material. The character of gold and carbon 
at the scale of the human hand does not appear related to motions like those 
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of the planets. The character of materials on this level are manifested in 
chemical and physical properties. Gold is lustrous, readily conducts heat, 
and is heavy. Carbon burns easily with oxygen. 

The description of the chemical and physical properties of gold, carbon, 
or any element in terms of bodies in motion bound by forces — or, as we 
shall now say, particles in motion — does not arise until one considers the 
ultimate divisibility of matter. Take hand-sized lumps of gold or carbon and 
imagine them subdivided into smaller and smaller pieces. What happens 
in the limit of finer and hirer subdivision? A long history of experiment has 
revealed that an irreducible particle that possesses the properties of gold, 
carbon, or any other element is finally encountered. Break that particle 
down to a hirer scale and one no longer has gold, carbon, or the element 
with which one began. This irreducible particle of element is the atom. 
Every element is a macroscopic aggregation of a single kind of atom. 

In progressing from the macroscopic level of the human hand down to the 
microscopic level of the atom, one leaves the contiguous world of lustre, heat 
conduction, and combustion and encounters once again a discrete world of 
particles, motion, and force that images the motions of the heavenly bodies. 
The atom is mostly space just as the solar system is mostly space. It consists 
of a tiny positively charged nucleus and a constellation of distantly orbiting, 
negatively charged electrons bound to the nucleus by an inverse-square force 
much like planets bound to the sun. 

The rhythmic properties and patterns of the Periodic Table are a mani- 
festation of the law of motion with an electrical inverse-square force math- 
ematically identical to that of the gravitational inverse-square force. The 
unity of the heavens and the elements is reborn from the womb of motion 
itself. It is in their law of motion that one glimpses the cosmic unity of the 
heavens and the elements. 

The unity of the cosmos inheres in a law of motion, a law hinted at by the 
pattern and order in matter and in the orbits of heavenly bodies, a law first 
quantitatively rendered by Kepler and Newton. But Newton’s discovery 
is more than law. It is beautiful law. The subsequent work of Euler, 
Lagrange, Hamilton, Jacobi, and others in the eighteenth and nineteenth 
centuries revealed that Newton’s law of motion derives from an even more 
fundamental principle: the principle of least action. The action principle 
weaves motion in a magnificent mathematical tapestry, a work of art of 
sheer simplicity in its central design, yet richly elaborated in detail so hue, 
so astonishingly complex, it cannot be exhausted by even the most powerful 
functions of mathematical analysis. It is a beautiful thing to behold; and 
it is our goal to do so as this story of the physical world unfolds. 

The law of motion embraces both the heavens and the elements; for a 
universal law of motion is law unto the small as well as large. But it would 
not be until this century that the law of motion governing the heavens 
would be extended into the realm of the elements revealing the profound 
connections — and dramatic differences — between the quantum mechanics 
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of the elements and the celestial mechanics of the solar system. 

One says “law” but one does so with a daunting sense of its inadequacy. 
“Law” is both the right word and the wrong word to describe the princi- 
ple underlying the dynamics of nature. The rich texture, structure, and 
complex pattern of the physical world are ultimately traceable to strikingly 
simple mathematical statements about the nature of all motion. It is the 
potent rendering of all motion under a simple mathematical principle that 
resonates with our concept of law. 

But there are contradictions in “law” as the defining category of the 
patterns and order one sees in nature. Law invokes the sense of an in- 
evitable routine. Law dictates determinism. Yet the richness of the cosmos 
belies — dare one say — “mechanical” subservience to a mathematical law 
book. Nature manifests spontaneity and surprise as much as pattern and 
law. 

The final task of the Shaggy Steed of Physics will be to reveal that 
the rigid husk of the law of motion contains unexpected seeds of freedom 
within. The law of motion inherently gives rise to behavior that is, from a 
mechanical point of view, unpredictably open. This discovery in which the 
determinism and constraint of the law of motion open outward to creative 
diversity is a profound development in mechanics. In it, physical law is 
at one with music and poetry: a rigid rule of rhyme or rhythm provides a 
ground for creative inspiration at precisely those points where it is broken. 

The law of motion is not a static statute. The law of motion is a sub- 
lime mathematical fabric reflecting the creation. Its unfolding implications 
of cosmos and chaos to this day surprise and illuminate us. It has been 
subject to the epochal evolution from classical mechanics to quantum me- 
chanics while at the same time it possesses a seamless continuity flowing 
back through Feynman and Dirac to Hamilton and Lagrange, to Newton, 
to Galileo and Kepler, and ultimately to the Pythagoreans. 

The structure of the world revealed by the mechanics birthed by Kepler, 
Galileo, and Newton is physical, material. The physical world is illumi- 
nated by mathematics. Mathematical physics grasps the motions of the 
heavens and the elements through bodies with properties such as mass and 
charge that interact with forces that depend in a geometrical manner on 
the motions of the bodies. Yet it partakes of the same divine attributes 
which prompted the celebration of the heavenly spheres: beauty and truth. 

Physics has come to reveal the unity of the richness, diversity, and com- 
plexity of the natural world by piercing through these things for the kernel 
of explanation. Physics grasps the entire natural world with what seems 
precious little: the motions of a small number of elementary particles that 
interact by means of an equally small number of forces. In the case of 
the elements and the large-scale motions of the solar system, two kinds of 
particles (electrons and nuclei) bearing two properties (mass and charge) 
corresponding to two forces (gravity and electricity) suffice. 

This is one side of the mystery. The other side is that everything 
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material — all the elements and molecules, the animal, vegetable, and min- 
eral world, and the motions of solar systems — spring from the interaction 
of these particles and forces. And in that the physical world is revealed in 
a mystical light: an abundant, interleaved richness of space and matter — in 
stars, planets, waters, earth, and living creatures — all made of the same 
elementary particles blooming riotously out of a simple law which itself 
dissolves in freedom. It is a unity more wondrous than that of the celes- 
tial spheres. And one fully spiritual. For when one renders the fullness of 
respect to the truth in the material world, matter becomes luminous with 
spirituality, revealing the mystery of the Creator in a way both more im- 
manent and more abundant than that which held the imagination of the 
Greek and medieval mind. 




Chapter 3 



The Law of Motion 



The motion of the universe is not assigned a fixed score. Nature orchestrates 
motion in a far more graceful manner with the action principle. According 
to the action principle, the motion of the universe is free save for this 
requirement: in their movement between any two points in space and time, 
particles of matter take, from the infinite number of possible paths, just 
those for which their action has the least possible value. From this simple 
principle come the planetary orbits of heavenly bodies and the structure of 
chemical elements. 

As we progress on this journey through mechanics we shall encounter 
the many faces of this mechanical quantity in nature called action. And we 
shall draw out the implications of the overarching principle, action shall be 
a minimum, for the heavens and the elements. Ultimately we shall find the 
action and the law of motion things of astonishing simplicity: the many 
faces of the action are measures of the lengths of paths the objects of the 
world trace out in their motion; the law of motion the simple requirement 
that these paths are always the shortest routes possible between any two 
points. 



The Warp and Weft of Motion 

The primitive strands of experience out of which motion is woven are 
space and time. It is with these elementary perceptions that one begins the 
ascent to the principle of least action. 

At the human scale it takes three independent pieces of information to 
pin down a location in space. The location of a point within a multi-story 
building requires three numbers: specification of the floor of interest and 
two distances from any two sides of the building to a point on that floor. 
The location of an aircraft in flight requires three numbers: a longitude, a 
latitude, and an altitude. Our spatial world is a three-dimensional world. 

Why three? No one knows why the human-scale world is a three-world. 
A world of only one or two dimensions would be too constricted for any 
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rich texture in creation; a world of many dimensions too free, too densely 
filiated. Perhaps a world with more than one or two dimensions (but not 
vastly many more) provides for a maximum diversity of form and structure 
without degenerating into a tangle of monotonous chaos. 

To reckon motion one needs a basis for the description of space — a ref- 
erence frame in which one may locate and describe the positions of the 
objects of the world. Three independent lengths xi, X2, X3 may be used to 
single out a point in space. The triplets (xi, X2, £3) are said to constitute a 
manifold. A manifold is the set of all points where particles in motion may 
be located. 

A manifold of points is still a formless space. To give form to the mani- 
fold, one may specify that the lengths be laid out along mutually perpen- 
dicular directions. To be perpendicular, the line from the origin of the 
coordinates to the point singled out (denoted x in Fig. 3-1) must depend in 
a particular way upon the three lengths xi, X2, X3 that specify that point. 
The form of that dependence and the meaning of perpendicular go hand in 
hand. 

The specification of a point by mutually perpendicular coordinate axes 
and an ancient geometrical object — the triangle — are intimately connected 
in the reckoning of space. Any of the three lines xi, X2, X3, taken in pairs, 
form two sides of a triangle. These lines are perpendicular when the triangle 
they form has a third side given by the rule of Pythagoras, the square of the 
third side is the sum of the squares of the two adjacent sides. If the lines of 
length xi and X2 are the two adjacent sides, the length l of the third side 
of this triangle is given by the Pythagorean rule as l 2 = x \ 2 + X2 2 as shown 
in Fig. 3-1. 

Now consider the line of length l and the coordinate line of length X3 
which are also adjacent sides of a triangle. The third side of this triangle 
is the line of length x. By invoking the Pythagorean principle again, one 
finds x 2 = l 2 + X3 2 and the fundamental rule of Euclidean space, 

x 2 = xi 2 + x 2 2 + X3 2 . (3.1) 

The specification of the manner in which the distance between points in 
space is related to the component lengths that define those points is called 
the metric of the space. The metric endows three independent coordinates 
with an internal structure. In the case of the metric (3.1) the manifold has 
Euclidean structure. 

The position of a point in space is specified by three lengths structured by 
the Euclidean metric (3.1). Such a threefold quantity obeying the Euclidean 
metric is a position vector specifying points in space and is symbolized as 

x = (xi,x 2 ,x 3 ), 

where xi, X2, and x 3 are the three orthogonal lengths that specify the direc- 
tion and length of the vector. The length of the vector x is its magnitude 
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Figure 3-1. The Euclidean Description of Space. A point in space is specified by a 
vector x whose components are equal in number to the three dimensions of the space. 
The length of the vector x is x and may be computed in terms of its component lengths 
by first computing the side length l of the triangle formed by xi and X 2 • The triangle 
formed by the sides of length l and £3 then has a side x given by the Pythagorean law 
as x 2 = x\ 2 + X2 2 + X3 2 . 

and is symbolized as x or sometimes as \x\. In Euclidean space the square 
of the length of a vector is equal to the sum of the squares of its component 
lengths. 

We have established a reckoning of space; but ours is a world in motion. 
The objects one identifies at particular locations in space do not remain 
there. They change their positions. Another coordinate beyond those that 
mark off the spatial location of objects is called into being. Time is this 
coordinate; but it is important to note that it is our experience of change 
which prompts our sense of time. 

We shall join Newton and Galileo in the seemingly unquestionable as- 
sumption that the flow of time is absolute. Time flows in a clock or hour- 
glass independently of the objects one locates in space. Likewise, time does 
not depend upon the spatial location of the clock in which it is recorded. 
As a result of that assumption, space and time do not intertwine in the 
measurement of distance. The three spatial coordinates plus the time coor- 
dinate constitute four coordinates. The three spatial coordinates are united 
by the Euclidean metric; but the time coordinate is aloof from this union. 
Time goes its own way. In the mechanics bequeathed to us by Galileo and 
Newton the time t at which a measurement of position is made does not 
directly enter into the calculation of length of the position vector accord- 
ing to Eq. (3.1). Only the three spatial components xi, X 2 , £3 enter into 
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the computation. The four coordinates consisting of space and time are 
regarded as three spatial coordinates which are a function of time. 

There is no guarantee that the space of the physical world is Euclidean 
or that the flow of time is absolute. The conception of space and time is 
a product of human imagination. Our concrete experience with triangles 
shapes the imagery of Euclidean space; and the world conforms to Euclidean 
space quite well under ordinary, earth-bound conditions. But the space of 
the world need not be Euclidean under conditions beyond the human-scale 
experiences in which it is grounded. 

The assumption of the Euclidean structure of the physical world was 
never explicitly acknowledged until Henri Poincare, Albert Einstein, and 
others questioned it in the early part of this century. Einstein showed it to 
be arbitrary, a limiting form of a more fundamental geometry and metric 
valid at speeds small compared to the speed of light. Since the motion in 
both the solar system and the atom takes place at a small fraction of the 
speed of light, the ancient rule shall guide our journey through the heavens 
and the elements. Space is Euclidean. Its structure is that of a metric 
inspired by the Pythagorean triangle rule. 



Mechanical Genesis 

Outside of time there is only one vector — position in space. With time, 
the position vector evolves and changes. It becomes a function of time 
x = x(t). In a world of time as well as space a second elementary vector 
exists. This is the velocity, x = dx/dt. The primal objects of motion are 
the two elementary vectors, position and velocity. 

With the existence of a second elementary vector, interaction with the 
first becomes possible. The two elementary vectors may be combined to 
generate a family of mechanical quantities. The simplest is the projection 
of one vector onto the other. This projection yields a single quantity from 
two three-component quantities. This quantity is the scalar product. A 
vector is a three-component object but a scalar is a single quantity. For 
any two vectors, for example x and x, the scalar product is symbolized as 
x • x and formed as a sum of products of the components which produce it 
as 

X • X = XiX i = X\X\ + X2X2 + xMs- 
i 

If the two vectors are perpendicular, the scalar product is zero. If both 
vectors are identical, the scalar product produces the square magnitude of 
the vector, 

2 1 2 1 2 

x • x = X\ + X2 + X3 . 

Albert Einstein introduced a useful notation for indicating the summations 
that occur in operations with products of vectors. In Einstein’s convention 
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a repeated subscript implies summation over the range of the subscript. 
Thus, the scalar product x • x = ]T\ x i&i can be compactly written in 
subscript notation without the summation symbol as 



Xi±i = X\X\ + X 2 X 2 + X3X3. 



When the subscripts are distinct, as in XiXj, no summation is to be per- 
formed. When the subscripts are the same, the products are summed. 
Note that when the subscripts are distinct, their identity must be pre- 
served. But ouce a scalar product is indicated by making the subscripts 
identical, the subscript pair indicating the sum can be changed at will, 

A plane can be defined by a single vector. It is that plane in which all 
lines perpendicular to that vector and intersecting it at some point must lie. 
But in three-dimensional space a plane can also be defined by two vectors 
as the common plane in which both lie. Thus, the existence of two vectors 
in three-dimensional space naturally implies the existence of a third vector 
perpendicular to each of them. This vector is their vector product. 

The vector product of two vectors such as x and x (sometimes called the 
cross-product) is itself a vector with three components and is symbolized 
as x x x. The three components of the vector product are antisymmetric 
pairs of the components of the vectors that produce it: 



(x x x) 1 = X2X3 - X3X2, 

(x x x ) 2 = X3X1 - X1X3, 

(x X x ) 3 = X\±2 — X 2 Xl- 

The components of the vector product are structurally identical. Any com- 
ponent may be obtained from a previous component by advancing all sub- 
scripts by 1 in the cyclic order 123123 .... A component of the vector 
product in a given direction does not contain any of the components of its 
constituents in that direction. This is the manifestation of the orthogonal- 
ity of two vectors with respect to their vector product. The antisymmetric 
pairing of component products requires that the vector product of two vec- 
tors which are parallel vanish; for if two vectors such as x and x are parallel, 
their components are fixed ratios of one another, X\fx\ = X2/X2 = X3/X3. 
For the same reason, the vector product of a vector with itself is identically 
zero. If the order of the terms is reversed in a vector product, the vector 
product changes sign, x x x = — x x x. 

The vector product may be compactly expressed in subscript notation 
utilizing the alternating tensor eijk as 



(x X x)j = CijkXjXk, 



i,3,k = 1,2,3. 
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The alternating tensor is a simple object.* Its components take only the 
values 0, 1, or —1. If two or more of the ijk are identical, then = 0. 
If the ijk “wind forward” in cyclic order 123, 231, 312, then = 1. If 
the ijk “wind backward” in anti-cyclic order 321, 213, 132, then e^-fc = 
— 1. Einstein’s convention for repeated indices implies summation over the 
indices j and k in the vector product CijkXjXk so that, for example, one has 
(x x x)i = eijkXjXk = x 2 x 3 - x 3 X 2 - 

What kind of dynamical structure naturally arises when one combines 
the two primal vectors x and x? The products x x x and x x x both vanish 
while the product x • x is just one-half the time derivative of x • x. This 
leaves only three unique products which may be formed from position and 
velocity: 

x • x, xxx, x x. 

The first is the square magnitude of the position vector itself which has 
already been described. The second is a fundamental quantity in the de- 
scription of motion. Multiplied by the particle mass m, it is the angular 
momentum J = x x mx. Because the angular momentum is a vector 
product, it is perpendicular to both the position and velocity vectors. The 
angular momentum vector defines a plane in which the position and velocity 
vectors lie. 

The last product is the square magnitude of the velocity vector. Mul- 
tiplied by one-half the mass rri of the particle, it has come to be known 
as the kinetic energy, T = mar/2. These dynamical quantities may also 
be expressed directly in terms of the particle linear momentum p = mx 
instead of the velocity as J = x x p and T = p 2 /2m. 

Three components of position x, three components of linear momentum 
p, three components of angular momentum J, and the kinetic energy T 
are the fundamental objects created by the motion of a particle. Motion 
in three-dimensional space naturally gives rise to these ten (each vector 
contains three) dynamical quantities. We shall find this tenfold set of dy- 
namical quantities appearing again and again in the unfolding story of 
motion. 

We now have the elementary vocabulary of mechanics. We have yet 
to speak of the law of motion. Yet much of the dynamical structure of 
motion — linear momentum, angular momentum, and energy — is naturally 
implied by the position and velocity vectors and their composition. 



* The alternating tensor components are explicitly 



/0 0 0 \ / 0 0 - 1 \ / 0 1 0 \ 

£1 jk = ( 0 0 1 ) , £2 jk = ( 0 0 0 I , £3 jk = ( — 1 0 0 I 

Vo -10/ Vi 0 0/ V 0 0 0 / 
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A reference frame from which one may observe and describe the motion 
of the world consists of space and time coordinates (. x,t ). But the world 
exists quite independently of any reference frame. The reference frame is 
our creation — and there are arbitrarily many of them from which we may 
observe any motion. The reference frame is essential for observation of 
the world. Without it one sees nothing with certainty. Yet what one sees 
should not be an artifact of the reference frame. 

Motion presents a formidable challenge to this principle; for the dynamics 
which one experiences as well as the mathematical form of the law of motion 
depend upon the frame of reference in which one observes the motion. One 
may plant one’s reference frame on the moon, on a ship moving on the 
ocean, on a spinning merry-go-round, or on a soaring bird. Moreover, 
one may keep time on different clocks in each of these reference frames. 
The motion which is observed in all these reference frames is governed by a 
universal law. But that law will not exhibit a universal form when expressed 
in the coordinates (x,f)i, (x,f) 2, ... of each of these reference frames. 
Although the law of motion is universal, its form in various reference frames 
is not. 

Is there a unique reference frame that is truly universal, one from which 
the motions in all other reference frames may be reckoned? The nineteenth 
century physicist and philosopher Ernst Mach suggested that such a refer- 
ence frame was given by nature. He proposed that the universal reference 
frame was one in which the “fixed stars” were at rest. We now know the 
stars are in violent motion. So Mach’s proposal does not provide us a 
naturally given reference frame in which the law of motion will reveal its 
universal form. 

Since nature does not provide a universal reference frame attached to 
natural objects, one must proceed indirectly. One must conceive such a 
reference frame by incorporating just those properties of motion that are 
universal in all reference frames while avoiding those that are unique to 
particular frames. This was the approach followed by Isaac Newton who 
conceived of such a universal reference frame as an “absolute space,” — a 
blank canvas stretched across space and time devoid of all local references 
to motion. It is upon this blank mathematical canvas that one invites 
nature to paint the law of motion. 

Peculiarly local properties of reference frames are their origins and their 
directions. One therefore blanks out the canvas of space-time with a wash 
of symmetry that purges it of all unique points of origin and all unique lines 
of direction. The symmetry of origins is that there is no unique point in 
either space or time from which motion must be reckoned. No point in the 
universe or in time has a special status over any other. No point on the 
canvas of motion can be painted in as a unique origin. 

Direction is a more subtle notion; for there appear to be two ways in 
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Figure 3-2. Galilean Relativity. The mathematical form of the law of motion is identical 
in all reference frames displaced from one another, uniformly translating with respect to 
one another, and rotated with respect to one another. 



which it may be defined. Both arise out of experience. Move adamantly 
forward and one establishes a translationally-based direction fixed by one’s 
velocity. Point at a star and one establishes a rotationally-based direction 
fixed by the angle one’s arm makes with the horizon. In one case a veloc- 
ity specifies a direction; in the other an angle specifies direction. In the 
mechanics that came into being through Galileo and Newton both kinds 
of directional symmetry, one translational and the other rotational, are in- 
cluded. Symmetry with respect to directions means that there is neither a 
unique velocity nor unique rotational orientation from which motion must 
be reckoned. No unique line of direction can be drawn on the canvas of 
motion. 

The symmetry of the law of motion with respect to origins, translational 
directions, and rotational orientations means that it has the same mathe- 
matical form in all reference frames that have been translated and rotated 
with respect to one another. Similarly, the law of motion is the same when 
expressed in the times of all clocks whose time origins have been translated 
with respect to one another. 

The space and time in which the law of motion is independent of trans- 
lations of the space or time coordinates is said to be homogeneous. The 
space in which the law of motion is independent of rotations of the coordi- 
nates is said to be isotropic , after the Greek prefix iao meaning “equal” and 
rponog meaning “rotation.” Reference frames in which the law of motion 
reveals its universal form are homogeneous (there is no preferred origin) 
and isotropic (there is no preferred direction) as indicated in Fig. 3-2. 

The reference frames in which the law of motion takes its universal form 
are not attached to any particular object in nature (such as the earth 
or a star) but are rather created from the insight that homogeneity and 
isotropy are the universal symmetries underlying all motion. Reference 
frames in which the law of motion has its universal form are homogeneous 
and isotropic. Nature affirms this insight by presenting us with a pro- 
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Figure 3-3. Rotation and the Vector Product. The vector <5 fl X x is perpendicular to both 
Sfl and x. For infinitesimal rotations Sfl, the sum x + 517 X x has the same magnitude 
as x but its direction has been changed by a rotation through an angle Sfl. 



foundly simple law of motion in all reference frames that partake of these 
elementary symmetries. 

The requirements of homogeneity and isotropy may be expressed in the 
statement that the law of motion has the same form in all reference frames 
(x,f) and ( x',t ') that differ in translational displacement and rotational 
orientation. It is both simple and sufficient to state this condition for ref- 
erence frames that differ infinitesimally: those displaced and rotated by 
infinitesimal distances 5a, times St, velocities 5u, and angles Sfl. Homo- 
geneous and isotropic reference frames are related by the transformations 



x' = x T 6a T Sut, + Sfl x x, 
t' = t + St. 



(3.2) 



The law of motion has precisely the same form whether expressed in the co- 
ordinates (x,t) of one reference frame or the coordinates (x',t') of another 
related by the transformations (3.2). 

In the transformations (3.2) that endow reference frames with homogene- 
ity and isotropy, 5a and St represent shifts in the origins of space and time. 
The velocity Su represents a uniform translation of the space coordinates. 
The rotational transformation vector Sfl is a constant vector about which 
the coordinates are rotated. Since Sfl x x is perpendicular to both x and 
Sfl, the sum x + Sfl x x results in a rotation of x about the axis of Sfl 
as shown in Fig. 3-3. The infinitesimal vector product therefore represents 
a rotation. The direction of S fl is along the axis about which the rotation 
takes place in the pointing sense of a right-hand screw. For infinitesimal 
rotations the magnitude Sfl represents the angle of rotation about its axis 
and the sum x + Sfl x x has the same magnitude as x. Only the direction 
changes. For finite fl, the transformation x + fl x x is not a pure rotation 
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because the vector product is not only a rotation of the original vector; its 
magnitude is also changed.* 

The law of motion is the same in all reference frames differing by uniform 
shifts of their origins, uniformly translating at a constant velocity with 
respect to one another, rotated through a constant angle with respect to 
one another, and whose clocks are shifted in time by a constant amount 
from one another. The group of all transformations obeying Eqs. (3.2) is 
called the Galilean relativity group.) 

Ten symmetry parameters (each vector again contains three) describe 
the Galilean group. They are the space origin shift a, the space trans- 
lation u, the space rotation 17, and the time origin shift r. The blank 
canvas of mechanics can be thought of as a ground of motion formed of 
the most primitive, wide-ranging symmetries — an openness to all origins, 
all translational directions, and all rotational orientations. 



Interaction 

The Western vision of the physical world gives precedence to the whole 
as made of elemental parts. In this vision of mechanics, interaction is 
a fundamental feature of the natural world; for if the whole is made of 
elemental parts, the character of the whole arises out of the way in which 
the elemental parts interact. The interaction of elemental matter is the 
glorious drama in which the world is rendered. 

We call the interactions between particles of matter “forces.” The two 
known large-scale interactions of matter are those of gravity and electricity. 
How can this interaction be represented? At this point one stands at a 
great juncture in human imagination. On the one hand one can attempt 
to describe gravity and electricity as entities in themselves generated by 
properties of particles that are their raison d ’ etre : mass for gravity, charge 
for electricity. The interaction between material particles is then described 
in terms of forces which the particles exert upon one another by virtue of 
their mass and charge. The force fields may be computed for an ensemble 
of particles and inserted into a law of motion to find their effect upon the 
movement of the particles. This is the approach of classical mechanics — the 
mechanics first grasped by Isaac Newton and elaborated by Joseph-Louis 
Lagrange, C. G. J. Jacobi, and William Rowan Hamilton. 

On the other hand one can adopt a truly geometrical point of view and 
insist that force is an artifact of Euclidean geometry. In this view mass 



* Though it is not required here, a rotation through finite fl can be obtained in 
an almost magical manner. One exponentiates the infinitesimal transformation x 1 = 
x r + eijkStij '. Efc to obtain the finite rotation a? = 

f This term recognizes Galileo’s insight that bodies in motion with uniform velocity are 
free of forces (later codified in Newton’s first law of motion) and seems to have originated 
in P Frank, Ber. Akad. Wiss. Wien. Ila, 118 , 382 ,1909. 
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and charge do not create forces. They create geometry. Mass and charge 
induce a curvature in space which invalidates the Euclidean law (3.1). A 
new metric of space shaped by the presence of mass and charge rather than 
the Euclidean metric becomes the basis of the law of motion. This was 
Einstein’s way; and it is essential for motions with velocities of the order 
of the speed of light. Since the heavenly motions of our solar system and 
the microscopic motions of the electron in the atom take place at velocities 
only a small fraction of the speed of light, we shall follow the Newtonian 
approach. 

In the Newtonian picture mass and charge fill space with potential en- 
ergy. The gravitational potential energy V of two masses m± and m 2 located 
at the points xi and x 2 is directly proportional to the masses of the bodies 
and inversely proportional to the distance of separation |x 2 — xi| between 
them, 

_ mi?n 2 
|x 2 -xi|' 

The force f acting on a particle whose position vector is x and described 
by a potential energy V (x) is 

f = - W. 



The potential V is a scalar; but its derivative with respect to the vector x 
is a vector W = dV/dx called the gradient, 

W = {dV/dxudV/dxi, dV/dx 3 ). 



The force field that two gravitating masses create through the potential 
V = — mim 2 /|x 2 — xi| is an attractive inverse-square field first proposed 
by Isaac Newton. The force acting on the particle at x\ is f = —dV/dxi 



or 



?TIiTO 2 / x 2 - Xi \ 
|x 2 - Xil 2 \ |x 2 - Xl \) 



(3.3) 



Each of the parts of the force law may be read off to reveal a significant 
property of the gravitational force. The first factor on the right of Eq. (3.3) 
shows that the gravitational force is an inverse-square force. The force is 
proportional to the inverse square of the distance of separation between 
the particles. The negative sign shows that this force is attractive. The 
second factor is a vector of unit length directed from the center of mi to 
the center of m 2 . It shows that the gravitational force is a central force 
directed along the line of centers between the two particles. This follows 
from the fact that the potential energy field, consistent with the isotropy 
of space, depends only upon the magnitude of the distance of separation 
between the particles |xi — x 2 |. 

The ranges of the gravitational and electrical force are vastly different 
(the gravitational force between two electrons is forty-two orders of magni- 
tude smaller than the electrical force between them). Nonetheless the po- 
tential energy and force fields of gravity and electricity (for which charges 
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move slowly compared to the speed of light) are structurally identical. The 
potential energy V and force f for a system of charges are precisely the 
same as those of gravity with mass m replaced by charge e. Force outside 
the scale of the nucleus is inverse-square. The potential energy and force 
for either field may be expressed in the form 



V = -- 



f = —k 



\xi - x 2 \ 



(xi - x 2 ) 
|x 1 -x 2 |3 



(3.4) 



where for gravity k = m\m 2 and for electricity k = —e\e 2 . 

For an ensemble of particles of masses m Q , a = 1,2,... interacting with 
gravitational forces, the potential energy function is 



^ = -EE 



/3<a 



T'czTft'P 

« - x p\' 



The condition f3 < a is required so that interactions are not counted twice. 
The potential energy of a system of slowly moving charges interacting with 
electrical forces is the same as that for gravity with charges e a replacing 
the masses m a . 

The interaction between particles described by their force and energy 
fields must be consistent with the underlying symmetries of space and time 
given by Eqs. (3.2). This means the potential energy field must be the 
same in any two reference frames where the particle coordinates (x' a , t') 
and (x Q , t) are related by the homogeneous and isotropic transformations 
(3.2), 

V(x' a ,t') = V(x cn t). (3.5) 

It is again sufficient to verify this symmetry condition for infinitesimal dif- 
ferences between reference frames for which one expands V(x , a ,t l ) about 
V (x a , t) so that the condition (3.5) becomes 

£sy (x "- x “ ) + w (f ~‘ ) = 0 - 

a 



One now makes this transformation homogeneous and isotropic by express- 
ing (x' a ,t') in terms of (x a ,t) from Eqs. (3.2) for infinitesimal transforma- 
tions generated by <5a, Su, 6 ft, and St with the result 



E 



dV 

dx a 



(5a + Sut) + ^(x a x 

a 



dV 

dx a 



)-sn + 



dV 

dt 



St = 0 , 



where use has been made of the identity dV/dx a ■ (x Q x Sfl) = (x a x 
dV / dx a ) ■ Sfl. Since the symmetry parameter variations are independent 
and need not vanish, the potential energy fields of Galilean relativity satisfy 
the symmetry conditions 



E 



dV 

dx a 



= 0, 



^2 x a X 



dV 

dx n 



= 0 , 



dV 

dt. 



= 0. 



(3.6) 
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The first of the symmetry conditions (3.6) indicates that the sum of the 
forces acting on an ensemble of particles vanishes while the second shows 
that the total moment of the entire ensemble also vanishes. For the isolated 
interaction between any two particles, these conditions are contained in 
Newton’s third law of motion: the force exerted by the first particle on the 
second is equal and opposite to the force exerted by the second on the first 
and the force between the two particles is directed along the line of centers 
that connects them. 

It is a fundamental hypothesis of mechanics, verified down to the sub- 
nuclear level, that the symmetry between any pair of particles is preserved 
in the ensemble. Hence, all forces in nature are central forces and cancel 
in pairs in the sums in Eq. (3.6). Any potential V(x a ) which depends only 
upon the magnitudes of the distances between particles (for which gravity 
and electricity are paradigms) will satisfy this condition. 

The third of the symmetry conditions (3.6) shows that force and po- 
tential energy fields in Galilean relativity are independent of time. The 
interaction between particles is instantaneous. It can be seen from the 
forms of the force and potential energy (3.4) that any change in the posi- 
tion of one of the particles is instantly reflected in a change in the force 
acting on the other. The propagation of information about the change of 
a particle’s position is infinitely fast. The infinite speed of propagation of 
forces is a hallmark of Galilean space-time. 



The Gene of Motion 

The array of particle trajectories and their momenta, the shape, pat- 
tern, and structure these trajectories create in space and time — in sum, the 
whole nature of a given motion — issues from a single mechanical quantity. 
We shall work up to this quantity — the gene of motion— by first gathering 
together the properties of an ensemble of particles as a whole. The entire 
ensemble has a total mass m and a center of mass X defined as 

m = m a , X = to -1 Ym a x a , 



where the sum is over all the particles of the ensemble. The total linear mo- 
mentum P and the total angular momentum J are also a sum of individual 
particle contributions: 

P = ^p Q , J^x„xp Q . 



Since p a = m a x. a , it follows that the total momentum can be expressed 
directly in terms of the center of mass velocity by P = mX. In the event 
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that the total momentum P is a constant, this equation may be integrated 
to yield 

N = Pt — mX , 

where N is the constant of integration. The vector N fixes the motion of 
the center of mass of an ensemble whose total momentum P is uniform. 
It can be expressed in terms of the center of mass at some fixed time, for 
example, as N = — mXo where Xo is the center of mass at time t = 0. We 
shall call N the mass- center (which is to be distinguished from the center 
of mass X). 

The total kinetic energy is similarly a sum over particle contributions: 
T=Y^ r n»x 2 a /2 = ^2p 2 a /2m a . 



The sum of the kinetic and potential energy is the total energy H of a 
system of particles, 

H = T + V. 

The difference between the kinetic and potential energy is known as the 
Lagrangian 

L = T —V. 

The Lagrangian is the gene of motion. All the dynamical quantities 
describing motion may be shown to be descended from the Lagrangian. 
The momentum of a particle p a is the derivative of the Lagrangian with 
respect to velocity, p a = dL/dx a . The angular momentum of a particle 
J a may also be unfolded from the Lagrangian as J a = x a x d L/dx a . The 
force acting on a particle likewise derives from the Lagrangian as d L/dx a 
and the moment as x Q x dL/dx a . Consistent with the potential energy 
condition given by the first and second of Eqs. (3.6), the Lagrangian of an 
isolated ensemble of particles satisfies the conditions 



E 



dL 

dx a 



= 0 , 




dL n 

x b — = °> 
ox n 



(3.7) 



where the sums are over all the particles. 

All mechanical quantities turn out to be expressible in terms of the La- 
grangian and its derivatives with respect to two quantities, position and 
velocity. This is not by accident. It is a deep-seated principle of nature 
that a state of motion is completely specified by a twofold set of conjugate 
coordinates such as the positions and velocities of the particles. The co- 
ordinates of motion are said to be conjugate because they come in pairs 
and there exists a conjugal relationship between the two coordinates that 
constitute the pair. In the case of position and velocity, one coordinate is 
the time derivative of the other. 
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Everything one could want to know about a system of classical point 
particles is obtainable from the pair of conjugate coordinates. This need 
not have been the case; nature might have in addition to x a and x a utilized 
the accelerations of the particles x Q as well as even higher derivatives to fix 
a state of motion. But the world is so constituted that the acceleration and 
all the higher derivatives of a particle’s position are expressible in terms 
of a conjugate pair of coordinates such as x a and x a — and no more. The 
manifold of the motion of n bodies is a 6n-dimensional collection of positions 
and velocities. It is with these coordinate pairs that one must build the 
Lagrangian L = L(x ai x a ) from which all mechanical quantities flow. 

The total linear momentum P and angular momentum J may be ex- 
pressed in terms of the Lagrangian as 



ot 



dL 
dx Q ' 



J = ^2 x a x 

Ot 



dL 
dx a ' 



The mass-center N follows from the Lagrangian as 



ot 



-t-mX. 

dx a 



The kinetic energy is a homogeneous function of the velocities. It can there- 
fore be expressed in terms of derivatives of itself with respect to velocity 
as 

T 1 \ ' . 2 1 \ ' dT . 

Ot ot 

Since L = T — V and V is independent of x a , it follows that dT/dx a = 
d L/dx a and the kinetic energy is given by the Lagrangian as 




• Xa- 



The total energy may be written in T and L rather than T and V as 
H = 2 T — L. With the kinetic energy also expressed in terms of the 
Lagrangian according to the above forms, the total energy is obtained from 
the Lagrangian as 

H = ^2(dL/dx a ) ■ x a - L 

Ot 

or 

H = p a ■ x a - L. 

Ot 

The mechanical quantities P, IV, J, and H all issue from the Lagrangian. 
These quantities describe the ensemble as a whole and are ten in number. 
These ten global quantities are united with the ten symmetry parameters 
of Eqs. (3.2) by the action principle. 
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The Action Principle 



The action is the grand quantity in the orchestration of motion; and it 
too is built from the Lagrangian. The action embraces all the particles 
of the motion in an integral of the Lagrangian over the time path of the 
ensemble, 

S = I Ldt. (3.8) 

Jt-L 

The Lagrangian — the gene of motion — plays the central role in the action 
and the resulting law of motion. The action is its integral and the principle 
of least action is simply this: of all the possible paths the particles may take 
between any two points in space and time, they take those paths for which 
the action S has the least possible value. All other equations of motion 
including Newton’s celebrated law follow from this simple statement. 

One might well ask why, of the various mechanical quantities, nature 
chooses the difference between the kinetic and potential energies as the 
Lagrangian? The best answer one can give to this question in Galilean 
space-time is that in the continual interchange of kinetic and potential 
energy, nature seeks to minimize any deviation between the two. However, 
in the relativistic space-time of Einstein, least action and the trajectories 
to which it leads have a deep and beautiful geometric significance. We shall 
see the geometric significance of the action principle in Chapter 6. 

The action involves all the particles of the ensemble. It is useful therefore 
to make a natural extension in notation that describes the ensemble as a 
whole. The configuration of all the particles may be given by specifying the 
rectangular vector components x a . But one may also use other systems of 
coordinates such as spherical-polar and other coordinate systems involving 
angles and lengths. The essential idea is that one must provide s = 3n 
configuration coordinates to fix the positions of n particles. The ensemble 
is said to have s degrees of freedom. The generalized coordinates which fix 
the configuration of the particles therefore form an s-dimensional vector,* 

9 = (9i,92,*--,9s)- 

In the case in which the generalized coordinates are indeed rectangular, the 
components of q are just the components of the x a with the particles taken 
in a fixed order. 

The velocity vector corresponding to the position vector is 

9 = ( 91 , 92, • ■•,9s)- 

The Lagrangian L = L(x a ,x a ) becomes L = L{q,q) and the generalized 
momenta are 

P = dL/dq = (pi,P2, ■ ■ ■ ,Ps)- 

* Generalized coordinate vectors are written in latin script with boldface reserved for 
three- vectors in Euclidean space. 
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One now regards all mechanical quantities as having s components arranged 
in this fashion. These vectors also have a subscript notation such as qi,Pi, 
and follow the Einstein summation convention in the formation of scalar 
products where the sums now run over i = 1, 2, , . . , s. 

The advantage of generalized coordinates is that most expressions de- 
scribing the ensemble have the same form as those for a single particle. A 
typical quantity like the total energy H = p a ■ x a — L(x a ,x a ) may be 
written simply 

H(q,p) = Pifi - L{q 1 q). 

(Note, however, the exception of the vector product which is only defined 
for three-vectors, J = J2 a x a x p a .) 

Least action can be illustrated with the familiar image of a valley. At the 
minimum point of any function “at the bottom of a valley” the slope — the 
first derivative — vanishes. For a function of several variables, all the first 
partial derivatives vanish at a minimum point. 

Another way of stating the condition of vanishing slope is that the vari- 
ation of a function must vanish at a minimum point. The variation of a 
function f(q) is given by 

Af ^ A 
of = x-m. 

The variables qi upon which / depends are said to be its arguments. The 
Sqi are the variations of the arguments. The variation of a function consists 
of a sum of the variations of its arguments with the first partial derivatives 
appearing as the coefficients of the argument variations. The variation of 
a function shows how small changes in the arguments feed into changes of 
the function. If a partial derivative with respect to a given argument qi 
vanishes, the function suffers no variation from that argument no matter 
what the argument variation. Since the variations of the arguments are 
arbitrary, a vanishing variation Sq necessarily requires that every partial 
derivative df /dqi vanish at a minimum point. Most points of a function 
are not maximum or minimum points. The maximum or minimum points 
of a function can only occur at those special points where the first partial 
derivatives vanish. 

Least action requires a vanishing variation. But in the case of the action, 
one is not looking for a single point where the action variation vanishes; 
one is looking for the law governing whole functions q{t) for which the ac- 
tion variation vanishes. It is the embrace of whole functions in least action 
that gives the principle its power and sweep. A law of motion that sim- 
ply required a function to have a maximum or minimum at some point 
in space and time would not be a universal law of motion. The action is 
therefore said to be a functional , a quantity that has a single value corre- 
sponding to an entire function. It is customary to symbolize the variation 
of a functional such as S as SS. Where simple maximization or minimiza- 
tion produces points, action minimization produces a rule governing entire 
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sets of functions q(t), the trajectories of the motion. This rule is the law of 
motion. 

The determination of the general variation of the action S with La- 
grangian L and the law that the functions q{t) must satisfy for a vanishing 
action variation are the central achievements of the variational calculus 
whose development in mathematics was spurred by mechanics. As the ex- 
ample of a minimum point at the bottom of a valley illustrates, the variation 
of the action will be given by a sum of the variations of its arguments with 
the first partial derivatives of the action appearing as the coefficients. We 
have seen that nature fixes a state of motion by the positions and velocities 
of the particles in the Lagrangian L = L{q,q). There are therefore two 
sets of arguments, the q and the q , and two sets of derivative coefficients, 
the Lagrangian derivatives dL/dq and dL/dq. The action variation that 
results when the paths are given variations Sq while still passing through 
fixed end points at (q,t) i and (q,t) 2 is 



SS= / 6Ldt = 

Jti 



r t2 '8L . dL . . 

-Sq, + ^Sq, dt. 



dq. 



dq. 



(3.9) 



A problem now arises. There are two different variations, Sq and Sq, 
so the two Lagrangian derivative terms are not on the same footing. To 
proceed further one must have them on the same footing. This may be 
done by transforming the term in Sq into one in Sq through integration by 
parts, 

dL . . 8L d . 

= — —Sqi 

OQi UQ.i dt 



d_ 

dt 




d_ 

dt 




Sq,. 



The first term on the second line, d/dt[{dL/dqi)Sqi], may be immediately 
integrated and expressed in terms of its values at the end points ( Sq , t) 1 and 
{Sq,t) 2 - Since the path variations Sq vanish at the end points, this term 
vanishes. One therefore finds an important equivalence for path variations 
that vanish at the endpoints: 




dt 

UQi 




d dL 
dt dq i 



Sqi dt. 



The action variation due to the path variations thus reduces to 



SS=- 




( d_ dL 

V dt. dq i 




(3.10) 



Equation (3.10) contains the contribution to the action variation result- 
ing from path variations. But particle paths are not the only arguments 
that generate variations of the action. The action and the Lagrangian also 
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embody the underlying symmetries of motion. They must be invariant to 
origin shifts and translational and rotational orientation shifts of space and 
time. These variations are described by the symmetry parameters in the 
transformations (3.2). 

The argument variations of the action thus spring from two sources, 
the path of motion described by the trajectories q{t) in a given reference 
frame and the variations of the origin and the translational and rotational 
orientation of the reference frame itself. According to the transformation 
law (3.2), the variations of the origin and the translational and rotational 
orientation of the reference frame are described by the variations of the 
symmetry parameters a, u, fl, and r. 

The total action variation SS therefore results from the path variations 
Sq [that portion which Eq. (3.10) describes] and the variations of the ten 
parameters of the symmetry group, <5a, Su , 5fl, and St. Their coefficients 
then each vanish for a condition of least action. These coefficients, which 
are also known as the functional derivatives of the action with respect to 
the symmetry parameters, are developed in Note 1 where the variational 
principle is discussed in detail. They turn out to be the ten fundamental 
mechanical quantities P, N , J, and H. 

The resulting action variation contains that given by path variations in 
Eq. (3.10) which we shall label (5A pa th as well as the contributions from the 
variations of the symmetry group parameters which we label 5A sym metries ; 

SS — (lA'p;ith T ^Asymmetries* 

These two components of the action variation are 



(5Ap a th — 




( d_ (TL 
\dt dqi 




(3.11) 



S Asymmetries = — A P • Sa — AN ■ Su — A J ■ Sfl — A HSt. 



In the second of Eqs. (3.11), A indicates the difference of values between 
end points of the path of any mechanical quantity before which it stands. 
For example, AP = P 2 — Pi. 

Least action leads to two kinds of conditions: equations of motion cor- 
responding to the variations of the path proportional to Sq in the first of 
Eqs. (3.11) and conservation laws corresponding to the symmetry param- 
eter variations Sa, Su, Sfl, St which appear in the second. 

The principle of least action requires that SS vanish. A vanishing action 
variation with respect to the variation of the trajectories along the path Sq 
in the first of Eqs. (3.11) leads to the equations of motion of the particles, 



d_ dL_ 
dt dqi 



^=0 

dq t ’ 



* = 1 , 2 ,..., 



s. 



(3.12) 
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These are the Euler-Lagrange equations of the variational calculus, partial 
differential equations which functions that minimize a functional integral 
must satisfy. For the Lagrangian of motion they are none other than New- 
ton’s second law of motion; for since the momentum is pi = dL/dqi and 
the force is /,; = dL/dqi , Eq. (3.12) is actually 



f = f‘- < 313 > 

The action principle for the paths of motion can be thought of as a 
principle that establishes a particular relationship between the Lagrangian 
partial derivatives along this path. The derivatives appearing in the total 
differential of the Lagrangian L = L(q, q), 



1T dL , dL,. 
dL = — — dqi + -rrrdqi, 

OQi OQi 



in the action principle are not independent. Their relationship to one an- 
other is fixed by the equations of motion 



Pi = dL/dqi, pi = dL/dqi. 



The differential of the Lagrangian which satisfies the action principle is 
therefore completely expressible in terms of p and p, 

dL = pidqi + Pidqi. (3-14) 

A vanishing action variation with respect to the Galilean symmetry pa- 
rameter variations in the second of Eqs. (3.11) generates conservation laws, 
one for each symmetry parameter variation: 



A P = 0, AN = 0, AJ = 0, AH = 0. 



These laws are called conservation laws because the quantity governed by 
each conservation law is the same at both the beginning and end of the 
path. Since the path end points are arbitrary, the quantity is conserved 
during the course of motion between any two points. Such a quantity is an 
invariant. It never changes as the particles sweep along their paths. 

It is interesting to observe in the second of Es. (3.11) that the variation of 
the action wrought by the symmetries consists of products of the symmetry 
parameter variations and their corresponding invariants. The product of 
a symmetry parameter and its invariant is an “action.” Corresponding to 
the variations of the origin 5a is the total linear momentum, the invariant 
P = Pa- Similarly, the translational variations of velocity space 5u 
produce the mass-center invariant N = Pt — mX. Space rotations 5ft 
yield the total angular momentum, the invariant J = x Pa- Time 

translations St lead to the invariant H = pi/g — L, the total energy. 
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Pouring out of the condition of least action come Newton’s law of motion 
and the ten invariants of total energy, linear momentum, mass-center, and 
angular momentum. Once established by initial conditions the total linear 
momentum, mass-center, angular momentum, and energy never change. 
As a flurry of other mechanical quantities dance in space and time, the 
invariants remain forever fixed. These ten invariants are always known for 
any mechanical system no matter how many particles compose it or how 
complex the force laws. 

Conservation laws are a manifestation of the symmetries of space and 
time which the action and the Lagrangian possess. When these symme- 
tries are subjected to the principle of least action, a corresponding set of 
invariants emerge, each invariant linked to an underlying symmetry. Cor- 
responding to each parameter carrying a given symmetry is a conservation 
law yielding the invariant appropriate to that symmetry. This profound 
connection between symmetries and invariants which is forged in the prin- 
ciple of least action was first clearly portrayed by Amalie Emmy Noether* 
in the first part of this century (Note 1). 

The emergence of invariants out of symmetry is one of the generous reve- 
lations of the physical world, a high moment in physics, and a mathematical 
discovery of rare beauty in its own right. The discovery of Emmy Noether 
shows us that nature manifests the symmetries of the world through invari- 
ants. In so doing it provides us the most powerful insight ever to come into 
physics. The story of the physical world has largely been woven about this 
many-splendored theme. 



The Hamiltonian Heritage 

The Lagrangian L = T — V is the centerpiece of the action principle; 
but it is not an invariant. However, it has a near image, the total energy 
H = T + V, which is the leading invariant of mechanics and to which it is 
related by 

H = Pifi - L. 

It is possible to formulate the action principle with the energy rather than 
the Lagrangian. When one does so, the law of motion takes a striking 
form first exhibited by William Rowan Hamilton. The total energy as the 
centerpiece of the action principle has come to be known as the Hamiltonian. 

The action principle may be formulated with the Hamiltonian by ex- 
pressing S = J Ldt in terms of H rather than L: 

6S = 6 f (pifi — H) dt = 0. 

Jt i 



* See “Emily Noether” by C. H. Kimberling in Am. Math. Monthly 79, 136, 1972. 
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The equations of motion in the Hamiltonian formulation of the action prin- 
ciple may be found by eliminating the Lagrangian in favor of the Hamilto- 
nian. This is most simply done by exploiting the fact that the equations 
of motion are directly linked to the partial derivatives of the Lagrangian 
as in Eq. (3.14). The same is true for the Hamiltonian. The equations of 
motion in Hamiltonian format are directly linked to the partial derivatives 
of the Hamiltonian. To find them, take the differential of the Hamiltonian- 
Lagrangian relationship, 



dH = pidqi + qidpi — dL. 

The differential of the Lagrangian may be eliminated using Eq. (3.14) show- 
ing that the differential of the Hamiltonian has the austere, antisymmetric 
form 

dH = q-idpi - Pidq.i. 

The equations of motion may therefore be cast in the elegant form of the 
Hamilton equations: 

4i = dH/dpi, pi = - dH/dqi . (3.15) 

If the Lagrangian and the Hamiltonian are the genes of motion, the 
Hamilton equations reflect their double-stranded structure. The coordi- 
nates are linked in pairs; and this pairing is an intertwined, antisymmetric, 
double strand of derivatives of the Hamiltonian with respect to positions 
and momenta. 

Antisymmetric pairing of coordinates is a reflection of deep structure 
in mechanics induced by the action principle called symplectic structure 
after 7 tA enrog, the Greek word for twined or braided. Symplectic structure 
describes the unique way in which the action principle weaves the motion 
of the world as a many-plaitecl twining of positions and momenta. 

A state of motion is fixed by a twofold set of coordinates. In the La- 
grangian formulation of the action principle these coordinates are positions 
and velocities. One sees that in the Hamiltonian formulation these coordi- 
nates are positions and momenta. The Lagrangian and the Hamiltonian are 
equally valid carriers of the genes of motion, the Lagrangian L = L(q,q) 
coded in a position- velocity alphabet and the Hamiltonian H = H{q 1 p) 
in a position-momentum alphabet, with the translation from one to the 
other effected by p, = dL/dqi and q t = dH/dpi. The paired set of coordi- 
nates satisfying Hamilton’s equations are called canonical coordinates and 
Hamilton’s equations are called the canonical equations of motion. 

It is important to observe that the Hamilton equations are not themselves 
the trajectories of the particles. At the center of the law of motion one does 
not find trajectories but rather the Hamiltonian genes and the developmen- 
tal machinery of the Hamilton equations. The Hamiltonian contains not 
trajectories but a mathematical code of particle positions and momenta 
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from which trajectories may be constructed. The code must be read into 
the appropriate instructions from which the trajectories may be built. This 
is the function of the Hamilton equations. They read the Hamiltonian into 
a set of differential equations which conform to the law of motion. But one 
will not have the trajectories themselves until these differential equations 
are integrated. Integration is the developmental process of motion. The 
integrals of the Hamilton equations are the final expression of the trajec- 
tories encoded in the Hamiltonian — the fully formed motion in space and 
time. 

The only general method available for constructing the trajectories from 
the Hamilton equations is that of numerical integration, a method which 
gives up expression in mathematical functions for expression in streams of 
data. This method is of practical importance; and it also shows explicitly 
how states of motion evolve into new states under the action of the Hamilton 
equations. 

One numerically constructs the integrals of the Hamilton equations over 
a sequence of small steps in time At by approximating the time derivatives 
as finite differences, for example as 

. _ q(t. + At) - q(t) 

9 ~ At 

with a similar expression for p{f). The Hamiltonian H(q,p) and its deriva- 
tives may be evaluated with the state q(t),p(t). Hamilton’s equations may 
then be used to build the trajectories of q and p in incremental fashion, 



dH 

qi(t + At) = qi(t) + — At, 
dpi 



dH 

Pi{t + At) = pi(t) - — At. 

dQi 



An initial state sets the ensemble into motion. When acted upon by the 
Hamilton equations, each state q(t),p(t ) creates a succeeding state q(t+At), 
p(t+At). Each succeeding state serves as the initial state for the next state. 
States follow states as Hamilton’s equations develop, step by step, the entire 
trajectories enfolded in the genetic code of the Hamiltonian. 

Notice that the action itself makes no appearance in the Hamilton equa- 
tions; the trajectories may be found without ever dealing directly with it. 
The coordinates (q,p) and the Hamiltonian H(q,p), not the action, are 
the players in the Hamilton equations. The action, however, is very much 
present behind the scene. And as we further probe the nature of motion 
we shall find the action making a dramatic reappearance on center stage. 

Although any mechanical quantity f(q,p,t) may be obtained from the 
canonical coordinates, its time rate of change along the path of motion is 
of interest in its own right. The time derivative along the path of motion 
or, as it is sometimes called, the total time derivative of any mechanical 
quantity is given by 
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The rate of change along the path of motion first recognizes that a quantity 
may be an explicit function of time t and hence it possesses a time-rate of 
change described by d/dt. But more pointedly, the total rate of change in- 
corporates the change in a quantity brought about by its being carried along 
the path of motion as a function of q and p. All mechanical quantities — even 
those not explicit functions of time — possess this rate of change. 

Using Hamilton’s equations (3.15) to eliminate q and p, the total rate of 
change may be compactly expressed as 

d f <9/ 

dt = m +llH] ’ (3 ' 16) 

where [/, g\ is the Poisson bracket of any two functions of the motion: 



/ df_ dgp _ dg df \ 

\ % dpt dqi dpi ) ' 



(3.17) 



The Poisson bracket is built from a pair of derivatives with respect to 
the position-momentum pair; and this pairing is again an antisymmetric 
manifestation of symplectic structure. The Poisson bracket of two functions 
/ and g takes the character of the product fg of these functions. If / and 
g are both scalars, their Poisson bracket is a scalar. If only one of them is 
a vector, their bracket is a vector. If both are vectors, their bracket is the 
product of two vectors known as a second-order tensor (a vector being a 
first-order tensor). 

The Poisson bracket is a mathematical form of prodigious power; and its 
presence is ubiquitous throughout mechanics. This is because the complex 
dynamics of the world are ultimately woven upon two sets of coordinates. 
The Poisson bracket is a composition of two functions. It builds up complex 
dynamical quantities from a more primitive pair in accord with the law of 
motion. We shall find that virtually the full content of the law of motion 
is contained within the Poisson bracket. As the first of many illustrations, 
the Hamilton equations of motion themselves acquire a simple and elegant 
expression in Poisson brackets as 



4i = [Qi,H], Pi = \pi, H\. 

The most elementary Poisson brackets are those of the position and 
momentum vectors themselves. They are found to vanish for each vector 
separately. But the Poisson bracket of the fundamental pair of mechanical 
variables with one another is unity: 

[Qi, Qj] = 0, ki,Pj) = dip \ puPj } = 0, 

where Sp = 0 for i j and Sp = 1 for i = j. 

While the Poisson bracket of different components of linear momentum 
always vanishes, the Poisson brackets of different components of angular 
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momentum do not. Rather, angular momentum cyclically reproduces itself 
under the action of the Poisson bracket. The Poisson brackets of the angular 
momentum of a single particle are readily calculated to be 



[Jl,J 2 ] ~ J3, [J2, J3] — J 1 ) [J3, Jl] ~ J2, 

and the Poisson bracket of any two identical components vanishes. The to- 
tal angular momentum of the ensemble also has the same Poisson bracket 
behavior. This result may be naturally expressed through use of the alter- 
nating tensor as 

[./j. Jj = CijkJk- (3.18) 

Equation (3.18) also shows that the angular momentum is perpendicular to 
its own Poisson bracket, J • [J, Jj] = Jj] = Cjjk •/-, Jk = 0. As a result, 
the magnitude of the angular momentum and any one of its components 
has a vanishing Poisson bracket, 

[J 2 , Jj] = 0. (3.19) 

(Since the Poisson bracket is based on derivatives, it obeys the product 
rule: [J 2 ,Jj] = Jj] = 2J i [J il Jj].) 

The Poisson bracket of functions of the motion with the angular momen- 
tum is the most intriguing of the operations induced by Poisson brackets. 
A direct calculation shows that the rectangular position and momentum 
coordinates, like the angular momentum itself, are cyclically reproduced by 
Poisson brackets with the angular momentum, 

[X-l , Jj] — eijkXki [Pi ; J j ] ^ ijkPk • (3.20) 

As a result, the scalar magnitudes x 2 , p 2 , and xp all have vanishing Poisson 
brackets with the angular momentum according to the same argument as 
that leading from Eq. (3.18) to Eq. (3.19). An important property of the 
angular momentum Poisson bracket is therefore apparent: it vanishes for 
all scalar functions. This follows from the fact that a scalar function of two 
vectors such as x and p can only depend upon scalars formed from the 
vectors in the combinations x 2 , p 2 , and x p. 

The results (3.19) and (3.20) may be used to reveal an even more re- 
markable property of the angular momentum. Angular momentum cycli- 
cally reproduces not only itself but any vector function F = F(x, p) of the 
positions and momenta of the motion: 

[J'i • Jj] ^> jk Tp - . 

The Poisson bracket of any scalar with the angular momentum is instantly 
known: it vanishes. The Poisson bracket of any vector function of mo- 
tion with the angular momentum is also instantly known, no matter how 
complex the vector: it is the cyclic-reproduction of itself. 
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This magical property arises because a vector function F(x, p) can only 
be formed from the vectors x, p, and J = x x p in the form 

F(x, p) = ax + bp + cx x p, 

where a, b , and c are scalar functions of x and p. The Poisson brackets 
of the scalar functions with the angular momentum vanish while x, p, 
and J = x x p cyclically reproduce themselves — and the vector function 
F(x,p). 

The cyclic Poisson bracket, illustrated here by the angular momentum, 
is a deep and recurring presence in mechanics. Cyclic structure is a mani- 
festation of rotational symmetry. Since angular momentum is the invariant 
arising from the rotational symmetry of space, it is not surprising that 
the angular momentum possesses a cyclic Poisson bracket. But other me- 
chanical quantities will be found that also possess cyclic Poisson brackets 
with the structure of Eqs. (3.18) and (3.19) revealing that rotations are 
far-reaching symmetries of the physical world embracing more than the an- 
gular momentum. Indeed, we shall find that the symmetries of two-body 
motion with inverse-square force are rotational. 



The Mathematical World of Motioti 

One naturally thinks of motion in the configuration space of the particles. 
But nature tells us that a state of motion is described by not one but 
a twofold set of coordinates. The manifold of momentum coordinates is 
the other half of the complete space of motion. For a single particle, the 
space of motion is a six-dimensional manifold consisting of its three position 
coordinates and its three momentum coordinates. For a constellation of n 
particles, the state of motion is specified by a point in a 2s-dimensional 
manifold whose coordinates are the s positions and the s momenta of the 
particles where s = 3n. The space of motion is therefore always an even- 
dimensional manifold and has come to be known as the phase space. Phase 
space is the mathematical world in which the motion of the world is imaged. 

Motion in configuration space is a projection of the more comprehensive 
tableau acted out in phase space. Motion fills phase space with phase tra- 
jectories q{t), pit). Phase trajectories, when projected onto configuration 
space, yield particle trajectories. (One thinks of shining a light on a string 
representing a phase trajectory looping through phase space and project- 
ing its shadow on a configuration space plane.) This leads to an interesting 
way in which the structure of a trajectory in configuration space is created. 
For example, a phase trajectory which proceeds in a single direction may 
project onto configuration space with reversals as shown in Fig. 3-4 (a). 
Phase trajectories free of intersections may project into configuration space 
with intersections (b). Other more complex folds of the phase trajectory 
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(a) 




Figure 3-j • Trajectories in Configuration Space as Projections from Phase Space. A 
phase trajectory without reversals may possess reversals when projected into configura- 
tion space (a). A smooth, non-intersecting trajectory in phase space may project with 
intersections (b) and singularities such as cusps (c). 



project onto configuration space as trajectories with singular points, such 
as cusps, as shown in Fig. 3-4 (c). 

Geometrical figures in phase space are not just transformed images of 
the positions of particles in configuration space. Since half the coordinates 
of phase space are momenta, the phase trajectories are composed of both 
positions and momenta (or any appropriate conjugate pair of canonical 
coordinates). Curves, angles, surfaces, and volumes are built from the 
point sets of both position and momentum. The geometrical objects in 
phase space are truly dynamical rather than simply configurational. They 
embody momentum in their structure as well as the shapes created by the 
particle positions. 

The essence of configuration space is its Euclidean structure. The essence 
of phase space is its symplectic structure. (The Euclidean and symplectic 
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properties of a space are independent.) Phase space is inherently symplec- 
tic in a manner we shall define more precisely, but it is rarely Euclidean. 
Although each position and momentum vector creates a Euclidean subspace 
with its three components, the metric relationships between the position 
and momentum components of a single particle as well as those between 
the positions or momenta of different particles are not in general Euclidean. 
The only generic structure of phase space is its symplectic structure. 

It is useful to distinguish the two sets of canonical coordinates q and p 
in separate vectors for many purposes. But the unique properties of phase 
space become apparent when one makes the final leap to a single 2s- vector, 
the state vector of phase space, 

£ = (q,p)- 

The first s components of this vector are the position coordinates of all the 
particles and the following s components are their momentum components 
taken in the same order. A state of motion of the complete constellation 
of particles is specified by the phase vector £. Corresponding to the phase 
vector is the phase velocity, 

£ = ( q,p )• 

When one describes motion with the phase vector £, the symplectic struc- 
ture is carried by a 2s x 2s antisymmetric matrix known as the symplectic, 




with / the identity or unit matrix (of dimension s in the above expres- 
sion). The matrix J is the signature of symplectic space with the notable 
properties 

J=-f = -J- 1 , 

where J f is the transpose of J obtained by interchanging its rows and 
columns. The symplectic matrix also has a square which is the negative 
2s x 2s unit matrix 




The symplectic J has the defining property of inducing a vanishing scalar 
product on any phase space vector*: 

Vt = 0. (3.21) 



* It is customary to omit the subscripts in simple products of vectors and matrices. 
With Einstein’s notation explicitly written out, Eq. (3.21) reads &Jij = 0. 
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A symplectic space is a space of vectors that satisfy Eq. (3.21). This 
condition parallels the condition for Euclidean space in which the identity 
/ plays the role of the symplectic J: 

xlx — x 2 . 



In both cases, a quadratic form (a form involving sums of second-order 
products of the coordinate vector) is an invariant of the space. 

The Poisson bracket acquires a natural expression in symplectic space. 
It is the symplectic product of the derivatives of functions /(£) and g(f), 



r r 1 _ df_ ,dg 

^ d^ J dC 



Because the fundamental property of symplectic space is r/J rj = 0 for any 
vector 77 , one immediately sees that the Poisson bracket of any function 
with itself vanishes since such a form is just the defining form of symplectic 
space. 

This general form shows that the Poisson bracket of the phase vector £ 
itself with any function /(£) is 

K,/]- j%. 

The Hamilton equations involve precisely this form. The time derivatives of 
the coordinates that constitute the phase vector £ = ( q,p ) are the Poisson 
bracket of the phase vector with the Hamiltonian: 

f) M 

i = [Z,H] = J—. (3.22) 



The Poisson Bracket: Motion as Flow 

The classical motion of the world is mathematically captured in particle- 
points. It is interesting that the motion of these points creates a mathemat- 
ical image in phase space that is the flow of a fluid. The motion of particles 
sets the manifold of points in phase space flowing. This is illustrated by 
the Hamilton equations for which all points q(t), p(t ) are swept to new lo- 
cations q(t + At), p(t + At) in the time interval At. The cumulative tracks 
of these motions are the streamlines of the particle trajectories. 

The Poisson bracket plays the central role in the flow of mechanical 
quantities. If one traces a region through phase space in which a function 
E(£) such as the energy, angular momentum, or any other is constant, one 
traces out a hypersurface, a submanifold one dimension less than the full 
manifold of phase space as shown in Fig. 3-5. Submanifolds upon which a 
function is constant are called the level surfaces of the function. 
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Figure 3-5. Mechanical Quantities as Level Surfaces in Phase Space. A mechanical 
quantity F(£) exists in phase space as interleaved level surfaces one less dimension than 
the dimension of phase space. 



A level surface has at each point a vector normal to the surface given 
by dF/df as shown in Fig. 3-6. Such a surface also has vectors orthogo- 
nal to the normal vector that lie wholly within it. These are its tangent 
vectors. How are these tangent vectors determined? Remarkably, they are 
determined by the Poisson bracket. The Poisson bracket [£, F] of the phase 
vector £ itself with any function F(£) turns out to be a vector tangent to 
the level surface of F as indicated in Fig. 3-6. 

The Poisson bracket [£, F\ can be seen to lie in the level surface of F by 
projecting the normal vector dF/df on the vector [£, F] (i.e. , by taking the 
scalar product of the two vectors) and finding that it vanishes, 



OF 

~df 



dF dF 

W=et J et s0 - 



The Poisson bracket can be thought of as a machine that takes as inputs 
the phase vector £ and the function F(£) and produces as output the vector 
[£, F] which is tangent to the level surface of F. 

The fluid nature of the tangent vectors on the level surface can now be 
portrayed. At each point one has a local tangent vector [£, F}. One may 
think of this tangent vector as the “velocity” df/dr of a point of fluid lying 
on the surface, 

§ = (3-23) 

where r is a parameter analogous to time. Beginning at a point, one may 
trace out a curve £(r) as the parameter r varies which at every point has 
a tangent equal to [f,F]. Such a curve is the solution to the differential 
equations (3.23) for a space curve lying in phase space. These curves are 
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Figure 3-6. Normal, Poisson bracket , and Tangent Vectors. The normal vector to the 
level surface F(f) = const is dF/df. The Poisson bracket of the phase vector £ with the 
function F(£) is a tangent vector [£, F] of the level surface F(£) = const. 




Figure 3-1. Flow Streamlines on the Level Surface of a Mechanical Quantity. The 
Poisson bracket [£, F] generates the streamlines lying upon the level surfaces of the 
mechanical quantity F(f). 



the streamlines of the flow as exhibited in Fig. 3-7. Since the “velocity” 
vectors [£, F } are always tangent to the surface, the streamlines never escape 
the surface but lie wholly within it. The Poisson bracket thus shows that 
mechanical quantities F(£) exist in phase space as interleaved sheets, each 
sheet covered with a streaming flow of points known as the flow of that 
mechanical quantity. 

There is one mechanical quantity for which the streamline parameter r 
is indeed the time t and the Poisson bracket is indeed the velocity field £ of 
the motion. That function is the Hamiltonian; and the Hamiltonian flow 
is the preeminent flow in phase space. Hamilton’s equations induce a flow 
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with a velocity equal to the Hamiltonian flow velocity (3.22) at each point, 
f = [£, if], and this flow is that of the trajectories of the particles in phase 
space. 

Let us now find the significance of the Poisson bracket of any two quan- 
tities F(f) and G(£). Their Poisson bracket may be expressed as 



t ,G ^ df J df df ^ 



dG 

~di 



K.n 



One now finds a more general interpretation of the Poisson bracket: the 
Poisson bracket of any two functions is the projection of the normals of 
the level surfaces of the one upon the tangents of the level surfaces of the 
other. Since the projections of the normals upon the tangents of the same 
surface vanish, one obtains a natural interpretation of the vanishing Poisson 
bracket of the same function. 

The Poisson bracket shows how mechanical quantities intersect in phase 
space. If [F, G] 7 ^ 0, then the flow of F does not stay on level surfaces of 
G but cuts across them. Thus, G is not constant on the flow of F. The 
same is true of the flow of G with respect to the level surfaces of F. But 
if [F, G] = 0, then the flow of F stays not only on its own level surface but 
also on the level surface of G. The level surfaces of these two quantities 
have become one. 

A simple illustration shows schematically how this can occur. Suppose 
the quantities F and G each have one-dimensional level surfaces which are 
concentric circles. Each of the flows is a closed circular streamline on its 
respective circle. In general, these families of circles will intersect as shown 
in Fig. 3-8 (a) and the functions F and G will not be constant on each 
other’s flows. But if [F, G] = 0, then these two sets of level surfaces are 
merged into the torus as shown in Fig. 3-8 (b) and both flows lie upon the 
merged surface of F and G. 

A vanishing Poisson bracket signals a significant simplification in the 
flow. Although mechanical quantities generally fill phase space with level 
surfaces which entangle and intersect in complex ways, when the Poisson 
bracket of two quantities vanishes, the level surfaces of these two quantities 
become one common surface to which both flows are confined. A partial dis- 
entangling of the flow has occurred. These two mechanical quantities now 
possess a simple relationship. Their flows are bound to the same sheet, 
however convoluted the lay of that sheet in phase space and however com- 
plicated its relationship to the flows lying on the sheets of other mechanical 
quantities. 

In summary, every mechanical quantity F(£) has an image in phase space 
as sheets of level surfaces filled with streamlines generated by [£, F] called 
its flow. The flow of a mechanical quantity necessarily lies upon its own 
level surfaces; but it need not lie upon the level surfaces of another quantity. 
The Poisson bracket of any pair of mechanical quantities [F, G] describes 
the intersection of their flows. If the Poisson bracket of any two quantities 
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Figure 3-8. Two Flows with Vanishing Poisson Brackets Share a Common Level Surface. 
Each flow of the mechanical quantities F and G lies upon a level surface which is a circle. 
In general, [F, G] ^ 0 and the flows cut across one another in phase space (a); but if 
[ F , G\ = 0, the flows no longer cross one another but arrange themselves upon a common 
level surface which is a torus (b). 

vanishes, their level surfaces are common and the mechanical quantities 
they represent are constant along one another’s flows. Phase space has a 
correspondingly simple organization with respect to these quantities. A 
common surface is aligned with both flows. 



Poincare Recurrence 



Little is known about the general motion of many bodies in phase space; 
but one powerful theorem does exist and it has important implications: the 
phase flow is incompressible. An element of phase space containing a fixed 
number of points may be endlessly stretched, filamented, and dispersed by 
the flow. But it cannot be compressed or expanded. The volume of the 
same cluster of points, however distorted and smeared over phase space by 
the flow, is always the same as shown in Fig. 3-9. 

The deformation of a fluid element is fully described by the deformation 
of the flow field in which it resides. An element of fluid must grow in volume 
if it flows through a region in which the streamlines are diverging as shown 
in Fig. 3-10. The divergence of the streamlines is described by gradients of 
the flow in the direction of the flow. For example, for the Hamiltonian flow 
in rectangular coordinates one has div(£) = dfi/dfi, and it is this quantity 
alone that controls the rate of volume change of a fluid element. If V is the 
volume of a fluid element, its rate of change is 



1 dV 
V dt 



div(0 



dji 
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Figure 3-9. Volume Preservation in Phase Space. An element of phase space containing 
a fixed number of points at time t = 0 may be smeared throughout phase space by the 
flow for t > 0; but it cannot be compressed or expanded. The volume of the element, 
however dispersed, is always the same. 



Phase flows are incompressible ( dV/dt = 0) because their velocity fields 
are inherently divergence free. Again using the Hamiltonian flow as the 
illustration, one finds from Eq. (3.22) 



div(0 



d_ d_ 



H = 0. 



Because the symplectic scalar product of any vector (in this case d/df) 
with itself vanishes, the phase velocity field is a divergence-free field and 
the flow is incompressible. The Hamiltonian flow illustrates the point; but 
all flows [£, F } are incompressible and preserve volume in phase space. 

The incompressibility of the phase flow is the basis of the only generic 
theorems known for the general motion of many bodies. The most impor- 
tant of these concerns the recurrence of states of motions that are bound 
in a finite region of phase space and is due to Poincare. A cluster of points 
singled out at any instant of time subsequently courses through phase space 
as tubes of flow sweeping out volume. Since the volume of the space is fi- 
nite and incompressible but the time for sweeping is unbounded, the flow 
must eventually sweep out a volume it has already once swept through. 
The tubes of flow therefore intersect themselves and the flow returns to the 
vicinity of phase points through which it has already passed. States of a 
Hamiltonian flow in a compact region therefore recur over the course of the 
motion. 

The Poincare recurrence theorem says nothing about the time required 
for a particular state to recur; it may be absurdly long compared to other 
important time scales in the motion. States cannot always be expected to 
recur in meaningful finite times. 



Invariants and the Elementary Flow 

The motion of the world is imaged in phase space as a flow; but this flow 
only becomes known to us when the integrals of the Hamilton equations 
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become known. Remarkably, the Hamilton equations, even those of the 
most complex motions, admit elementary integrals; and we are now going 
to find them. These integrals generate the elementary flow from which the 
flows of all possible motions may be built. 




Figure 3-10. Flow Divergence and Volume Change. The divergence or convergence of 
streamlines corresponds to the expansion or compression of the volume of fluid elements. 

There is an analogy here with complex matter and elementary particles. 
As all matter is built up from elementary particles, all motion may be built 
up from elementary flows. As elementary particles possess invariant prop- 
erties such as charge and mass, elementary flows possesses the invariant 
properties of a motion. As the elementary particle invariants are carried 
over to the complex matter which they build up, so too are the elementary 
flow invariants carried over to the complex flows which they build. And just 
as matter may be a bewilderingly complex assemblage of elementary par- 
ticles, so too may complex motions be an equally bewildering composition 
of elementary flows. 

To discover the elementary flow and the invariants of motion with which 
they are endowed, one must first appreciate the power in the generality of 
the canonical equations. Because the motion of the world stands outside 
the particular quantities we use to apprehend it, there is a corresponding 
freedom in the choice of mechanical quantities in which we may cast our 
description. The law of motion as it is expressed in the canonical equa- 
tions reflects this freedom. The mathematical forms of the action principle, 
the Hamilton equations, and the Poisson bracket are independent of the 
particular coordinates in which they are expressed. 

Beginning with the most primitive physical quantities — position and 
velocity — we have worked our way up to the Hamiltonian summit to dis- 
cover that the primitive coordinates that brought us here are not unique. 
The law of motion allows other coordinate choices. Quantities built from 
the positions and velocity such as the momentum, angular momentum, 
energy, and other mechanical quantities we shall yet encounter (and, one 
might add, from which position and velocity may be built) serve with equal 
validity as canonical coordinates. What is required is that for motion with 
s degrees of freedom one has 2s canonical coordinates which form s conju- 
gate pairs. It is not the coordinate strands themselves but their conjugate 
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pattern which is universally woven into the fabric of motion by the action 
principle. 

If the law of motion is to have the same form in all canonical coordinates, 
these coordinates must transform into one another in a particular way. Any 
set of canonical coordinates is related to another set by transformations 
that preserve the action principle, the Hamilton equations, and the Pois- 
son bracket. These transformations are naturally enough called canonical 
transformations and were worked out by the versatile nineteenth-century 
mathematician C. G. J. Jacobi. 

Consider another possible set of canonical coordinates ( Q,P ) in which 
one chooses to cast the same motion. These coordinates must be 2s in 
number divided between a “position-like” set, Q = (Qi, Q 2 , • ■ • , Q s ), and a 
conjugate “momentum-like” set, P = (Pl, P 2 , . . . , P s ). The notation (Q, P) 
is misleading to the extent that it suggests that Q and P are necessarily an 
actual position and momentum. The coordinates Q and P need not at all 
be position or momentum. They may represent any mechanical quantity. 
Moreover, there is no significance to “position-like” and “momentum- like.” 
It is the conjugal relationship between the coordinate pair rather than the 
individual member that is significant to the law of motion. This is easily 
seen by interchanging the position-like and momentum-like coordinates: 



Qi 



Pi, 



Pi 



Qi- 



Hamilton’s equations 

Qi = dH/dP u Pi = —dH/dQi, 



(3.24) 



maintain the same form under such a change. 

The connection between a set of canonical coordinates ( q , p) with Hamil- 
tonian H — H(q,p) and another set of canonical coordinates (Q,P) with 
Hamiltonian H' = iP(Q,P) is determined by requiring that they satisfy 
the same law of motion. This means that the action variations of both sets 
must be identical: 

SS = 6 I ( Pi dq t — H dt) = S j (Pi dQi — H' dt) . (3.25) 

J t\ Jt 1 



It is obvious that the variational problems appearing on each side of the 
right equality of this equation will lead to identical Hamiltonian and Poisson 
bracket structures, each expressed in its own coordinate language, that of 
Eqs. (3.15) and (3.17) for the (q,p) language, and Eqs. (3.24) for the (Q, P) 
language with Poisson bracket, 



[F,G\ 



^ dF dG 



dG dF' 



dQi dPi dQi dPi 
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where F(Q,P) and G(Q,P) are any functions of the new coordinates. In 
particular, all canonical coordinates satisfy 

[Qi,Qj]=0, [Qi,Pj] = S iji [Pi,Pj}= 0. (3.26) 

These observations in themselves, however, do not tell how the coor- 
dinates transform into one another. That connection is provided by the 
insight that the two integrals that appear under the variation in Eq. (3.25) 
need not be identical. It is their variations which must be identical. The 
integrals may differ by any function F that has a vanishing variation and 
still lead to the same variational problem for (q,p) and ( Q,P ). The dif- 
ference between the integrands in Eq. (3.25) must therefore be the total 
differential of a function F, 

dF = Pi dq t - Pi dQi -{H- H') dt. (3.27) 

The requirement that the two variational problems be the same intro- 
duces a new function into the picture. This function, known as the gener- 
ating function of the transformation, is an arbitrary function of one of the 
new and one of the old canonical coordinates and time. In the form (3.27) 
it is the function F = F(q,Q,t) and the relationship of the coordinates to 
F may be read off Eq. (3.27) as 

Pi = dF/dqi, Pi = - dF/dQi , H - H' = -dF/dt. (3.28) 

This generating function depends upon the old and new position-like co- 
ordinates. But it is only required that the generating function be a total 
differential of pairs of the two sets of coordinates. Another possible gener- 
ating function can be formed from (3.27) as 

dG = d[F + PiQij = Pi dq t + Qi dPi -{H- H') dt. (3.29) 

In this case the generating function is G = G(q , P, t) and the coordinates 
are 



Pi = dG/dqi, Qi = dG/dPi, H - H' = -dG/dt. (3.30) 

The generating function G(q, P, t) depends upon the old position-like co- 
ordinates and the new momentum-like coordinates. It is obvious that two 
other generating functions can be formed for the other pairs of old and new 
positions and momenta. 

The generating function is the key actor in canonical transformations, 
transforming the state (q,p) into the state (Q, P). Two things are notewor- 
thy about canonical transformations. First, the generating function incor- 
porates one coordinate from the old pair and one from the new pair. It ties 
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the two states together with one coordinate from each. Second, the canon- 
ical transformation presents one of the new coordinates explicitly and one 
of them implicitly. For example, in the transformation of the state (q,p) to 
the state ( Q,P ) by the generating function G{q,P,t), the implicit-explicit 
presentation may be made graphic by writing the first two of Eqs. (3.30) as 

O A 

—G(q,P,t) = pi, Qi = —G(q,P,t). (3.31) 

The new coordinates Q{q,p) are given explicitly by the second of Eqs. (3.31) 
while their canonical mates P{q,p) are given implicitly by the first. 

As elementary illustrations, the generating function 

G{q , P, t ) = qiPi 

is the identity transformation 

Qi q% 7 Pi Pit P P> 

while the generating function 

P{q, Qi f ) — qi.Qi 



yields the inversion 



Qi Pi 5 Pi qi , P — P* 

The canonical structure of motion tells us that there are coordinates 
other than the primitive coordinates in which we may cast a motion which 
satisfy the same law of motion: they are governed by the Hamilton equa- 
tions and possess the same invariants. The nineteenth-century astronomer 
C. E. Delaunay saw the way to use this connection to good advantage. 
Since the only requirement of coordinates is that they be canonical, why 
not choose coordinates in which the integrals of the Hamilton equations 
Qi{t ), Pi{t) yield a flow of maximum simplicity, a simplicity not usually 
apparent in primitive coordinates? This flow is the elementary flow and 
occurs when one-half the coordinates, say the Pi, are selected as constants 
Pi = Ii, with Jj = 0. The other half, Qi = on, are the canonical mates of 
these constants. 

The Hamilton equations (3.24) now simplify dramatically, taking the 
form 

ai = dH'/dIi, h = -dH'/dati = 0. 

While the second Hamilton equation has been reduced to its maximum 
simplicity, the first has also become simple. This is because a condition on 
one of the canonical coordinates impacts the other due to their symplectic 
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union. In addition to showing that the coordinates / are constants, the sec- 
ond set of Hamilton equations shows that the Hamiltonian is independent of 
the a coordinates: H' = H'(I) only. The Hamiltonian derivative OH' /dli 
is likewise a function only of I which may be expressed as wpl) = dH' /dli. 
The first set of Hamilton equations thus has the simple integral by quadra- 
ture: 

= J(dH'/dIi) dt = Ui t + Pi, 

where j3 = (Pi, P 2 , ■ ■ ■ , Ps) are its integration constants. 

The Hamilton equations possess the elementary flow integrals 

on = wit + Pi, Ii = const. (3.32) 

The elementary flow has the property that it “flows” only along the a co- 
ordinates. Its phase velocity has no components in the invariant coordinate 
I directions since I = 0. The integrals of the elementary flow depend upon 
constants of integration that are the two sets of s quantities p and I. 

The elementary integrals of the Hamilton equations (3.32) may be gath- 
ered together as 2s-dimensional symplectic vectors that are the phase vec- 
tor, its phase velocity, and the integration constants, 

5= (a, I), n = (w,0), T = (P, I). 

The elementary flow (3.32) underlying all motions is 

^.(f) = fit T T 

and its phase velocity is 5 = O. The elementary flow consists of s symplectic 
pairs 5, = (on, If). These are the fundamental flows from which all motions 
may be constructed by canonical transformation from the elementary phase 
space 5 = (a, I) to the phase space £ = ( q,p ). 

The elementary flow rests upon the constants X = (/?, I) . These con- 
stants are not only invariant on the elementary flow; they are invariant on 
the flow in the primitive coordinate phase space ( q,p ) of the motion built 
from it. Quantities invariant on the flow in one set of coordinates are in- 
variant on the image of this flow in all other canonical coordinates. The 
constants X = (p, I) are therefore singularly important quantities: they 
are the 2s invariants of the motion. Invariants are significant because the 
elementary phase space coordinates 5 = (a, I) are not connected in any 
simple way with their image phase space coordinates £ = ( q,p ) (the canon- 
ical transformation that connects them is generally not simple). But the 
invariants X = (p, I) are the same in both phase spaces. They are the 
invariant link between the two spaces. 

An invariant X satisfies a conservation law, 



A(X) = 0, 
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like the Galilean invariants in Eqs. (3.11). An invariant also satisfies the 
condition that its rate of change along the flow, or its total time derivative 
given by Eq. (3.16), vanishes: 

§ = § + |J,ff]=0. <“ 3 > 

This condition can be seen to be the same as the conservation law A (I) = 0 
by considering two points on the flow separated by an infinitesimal time 
interval At. The difference in the quantity I between these two points is 
A (I) = (dT/dt)At. Since A (I) must vanish for an invariant and the time 
interval At need not vanish, one is led to the condition (3.33). 

It frequently occurs that an invariant is not an explicit function of time 
(< dl/dt = 0) but depends only upon the canonical coordinates T = X(q,p). 
Equation (3.33) for time-independent invariants thus reduces to the condi- 
tion 

[X,H\=0 

showing that time-independent invariants have level surfaces in phase space 
that are coincident with the level surfaces of the Hamiltonian. 

In addition to obeying a conservation law, invariants also possess canon- 
ical structure. One readily shows from Eqs. (3.32) and (3.26) that the 
invariant set X = (/?, I) is a canonical pair satisfying the Poisson brackets 

[&,&]= 0, \fii,Ij] = S ij , [Ii, Ij] = 0. (3.34) 

Invariants of the elementary flow, like the invariants of elementary particles, 
persist into the complex flows built from them. Ten of these 2s invariants 
are the Galilean invariants created by the fundamental symmetries of me- 
chanics and are known in all motions; but most of the invariants of general 
motion are unknown to us as functions of the primitive coordinates X(q,p). 

Explicit knowledge of a single invariant Ii(q,p) = const certainly allows 
one to algebraically remove one of the 2s canonical coordinates leaving 2s— 1 
integrals of the Hamilton equations to be found. Such a reduction applies 
to any system of differential equations; but the fact that the equations of 
mechanics are Hamiltonian gives mechanical invariants a greater power of 
reduction. Knowledge of one invariant produces two integrals of Hamilton’s 
equations. This is because one invariant not only provides the integral of 
one Hamilton equation algebraically; it reduces the Hamilton equation of 
its mate to quadrature providing its integral as well, as illustrated by the 
integrals (3.32). The power of one invariant to produce two Hamiltonian 
integrals is a unique and remarkable feature of the equations of motion. It 
sharply distinguishes Hamiltonian mechanics from other systems of differ- 
ential equations. 

Every invariant produces two Hamiltonian integrals. It is therefore suf- 
ficient to possess s invariants to find the complete 2s integrals of a Hamil- 
tonian system. But there is a catch. The invariants I are canonical coor- 
dinates; they have vanishing Poisson brackets (3.34) and share a common 
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manifold. It is this simplicity in the flow which reduces its Hamilton equa- 
tions to quadratures. The collection of s invariants must be of one canonical 
half (say the Is); one cannot mix the Is and /3s together. A number of inte- 
grals of a Hamiltonian system equal to 2m may be found if to ^ s invariants 
with collectively vanishing Poisson brackets, 

(7i, / 2 , . . . , / m ), [Ii,Ij\ = 0, 



are known. 



Action Redux 

Given the elementary flow of the Hamilton equations, one is naturally 
interested in canonically transforming it into the flow in the primitive coor- 
dinates in which one observes the motion. What is the generating function 
that accomplishes this task? It is the action itself; and the existence of 
invariants brings us full circle to the action. We began our quest for the 
trajectories by minimizing the action. This procedure led to equations of 
motion and canonical coordinates while the action receded into the back- 
ground. The invariants and the elementary flow now reunite the action S 
with the canonical coordinates (q,p) in the following way. 

The momenta and energy are directly obtainable from the action. They 
are its derivatives with respect to the configuration coordinates and time. 
This follows from the fact that the total time derivative of the action is 
the Lagrangian, dS/dt = L. Since the energy H and the Lagrangian L are 
related as L = —H + piqi, the action time derivative is dS/dt = —H + Pi4i 
and the differential of the action is 



dS = —H dt + Pi dqt . (3.35) 

Equation (3.35) shows that the action is a function of the configuration 
coordinates and time S = S(q,t): 

dS , dS , 

dS = — dt+ —— dq z . (3.36) 

ot oqi 

Comparison of Eq. (3.35) with Eq. (3.36) reveals that the energy and mo- 
menta are 

H = -OS/dt, pi = dS/dq i. (3.37) 

In this form the law of motion may be regarded as a first-order partial 
differential equation for the action S itself. For since H = H(q, p) and the 
momenta are given by p t = dS/dqt by the second of Eqs. (3.37), the first 
becomes 



dS/dt + H{q , dS/dq) = 0. 



(3.38) 
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Equation (3.38) is the Hamilton- Jacobi equation governing the action S. Its 
explicit form for a system of particles in rectangular coordinates q = (x Q ), 
p = ( p a ) with p a = V a S is 

dS/dt + 53(l/2m a )V Q S- V »S + V(x a ) = 0. 

a 

The action S(q, t) is the solution to the Hamilton- Jacobi equation (3.38), 
a first-order partial differential equation in the s position coordinates q and 
the time t. The solution of such a partial differential depends upon s + 1 
constants of integration. One of these constants is purely additive; that is, 
if S(q, t) is a solution of the Hamilton-Jacobi equation, so too is S(q, t)+A 
where A is the additive constant. 

The s remaining constants clearly have to be invariants. They may be 
selected as one-half of the invariant set, say the invariants I. Moreover, 
since the total energy is an invariant, say H = £ = —dS/dt, the time 
dependence of the action is simple; it is —St. The action must therefore be 
of the form 

S(q,I,t) = -£t+S 0 (q,I), (3.39) 

where Sa(q, /) is the time-independent part of the action. It is important 
to note that the energy £ which appears in the first term on the right of 
Eq. (3.39) is one of the invariants I = (7i, 7 2 , • ■ • , Is)- 

The generating function of the elementary flow turns out to be the time- 
independent action So(q. I). If one bases this generating function upon 
the old positions q and the new momentum-like invariants / according to 
the template G(q,P,t ) of Eqs. (3.29) and (3.30), then the invariants / 
in S 0 (q, /) must be identified with the new momentum-like coordinates 
P = I in G(q,P,t ) = So(q,I). The remaining canonical coordinates are 
determined from the first two of Eqs. (3.30) in the form 

d d 

Pi= g^S 0 (q,I), a.i = —S 0 (q,I). 

Since cc* = u>it + (Si and uy = dH'(I)/dI-i, this result may also be expressed 
in terms of the full action S(q, /, t) as 

O A 

Pi = —S(q,I,t), Wi S(q,I,t)=Pi- (3-40) 

Equations (3.40) show how, given the invariants (/?, I) and the action 
S(q,I,t ), the primitive coordinates (q,p) are obtained. The position coor- 
dinates q are contained implicitly in the second of Eqs. (3.40); the momenta 
p are contained explicitly in the first. Further, since dG/dt = dS 0 /dt = 0, 
the third of Eqs. (3.30) shows that the Hamiltonians in the two sets of 
coordinates are the same; they are just expressed in different coordinate 
language: 



H'(I)=H(q,p). 
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The Hamilton equations and the Hamilton-Jacobi equation are differ- 
ent portraits of the same motion. In the Hamiltonian description, motion 
is cast in canonical coordinates ( q,p ) and a system of 2s ordinary differ- 
ential equations — the Hamilton equations — may be taken as the master 
equations of motion. One need not deal directly with the action itself [one 
can always find it by integrating Eq. (3.35) along the trajectories once the 
coordinates q(t),p(t) are known as integrals of the Hamilton equations]. 
In the Hamilton-Jacobi description, motion is cast in the action S(q,I,tf 
and the Hamilton-Jacobi equation, a single partial-differential equation in 
s degrees of freedom plus the time and which depends upon s constants of 
integration (plus one additive constant) is taken as the master equation of 
motion. From the solution of the Hamilton-Jacobi equation S(q,I 1 t), one 
can find the canonical coordinates ( q,p ) by differentiation as in Eqs. (3.40). 



Poincare Invariant 

The action is the central mechanical quantity in the orchestration of 
motion. Invariants are also basic quantities upon which motion rests. Is 
there a direct connection between action and invariants? There is: one of 
the invariants is always directly related to the action. It is the action on 
a closed contour in phase space and is known as the integral invariant of 
Poincare. 

From Eq. (3.35), the action can always be expressed as an integral over 
the canonical coordinates: 



S = J \pi dq t - H dt). (3-41) 

The action in the indefinite integral representation (3.41) is clearly not an 
invariant; but if instead of the indefinite integral (3.41), one considers this 
integral over any closed contour 7 lying in phase space, 

S = j) (pi dqi — H dt), (3.42) 

one finds it is invariant. The contour 7 itself is not invariant; it is deformed 
as it is swept through phase space by the flow; but the integral over this 
moving contour is always the same. This closed integral is the Poincare 
invariant. 

One should be aware that although the Poincare invariant exists for all 
motion, it is an integral of the canonical coordinates. It is inaccessible unless 
the canonical coordinates are known functions and the integrals can be per- 
formed. A known solution of the Hamilton or Hamilton-Jacobi equations 
is a prerequisite to an explicit representation of the Poincare invariant. 
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Illustration: Hooke Motion 



Let us now illustrate invariants, elementary flows, and the canonical 
transformation linking a particular flow to its elementary flow. We do 
so for the motion of a particle moving in one dimension and bound by a 
law of force proposed by Robert Hooke, Isaac Newton’s most prominent 
contemporary. The Hooke law of force corresponds to the potential 

V{q) = nq 2 / 2, (3.43) 

where n is a constant. It gives rise to an attractive force which is linear in 
the displacement, / = —dV/dq = —nq. 

The Hooke force does not correspond to any fundamental force in nature; 
but it does approximate the force between two bodies bound together by 
an elastic material such as a spring. (The actual forces in the spring mate- 
rial are electrical inverse-square forces between the nuclei and electrons of 
the atoms of which it is composed. The forces are inverse-square between 
any two charged nuclei or electrons; but the average force over myriads of 
charged nuclei and electrons for small displacements from their equilibrium 
positions turns out to be a linear force. This is because of the enormous 
cancellation of forces in a large ensemble of particles.) 

Hooke’s law of force leads to the Hamiltonian 

H(q, p) = p 2 /2m + nq 2 /2. (3.44) 

This is the Hamiltonian of a harmonic oscillator whose natural frequency 
is u>o = \J n/m. The canonical invariant I of this motion turns out to be 

I = H/loo (3.45) 

and its elementary flow, like all elementary flows, is given by Eqs. (3.32). 
The Hamiltonian in the elementary flow coordinates (a, I) is H'(I ) = ojqI, 
and the phase velocity of the elementary flow is 

u> = dH' /dl = uiq. 

To construct the flow in primitive coordinates one must have the gen- 
erating function of a canonical transformation from the elementary flow 
to the primitive coordinate flow. This generating function is the action 
S(q,I,t) = —£t + S 0 (q. I): and it is obtained by solving the Hamilton- 
Jacobi equation of the motion. This can readily be done; for the Hamilton- 
Jacobi equation (3.38) for one degree of freedom reduces to the ordinary 
differential equation dSo/dq = p(q). The Hamilton- Jacobi equation of all 
one-degree-of-freedom motions is integrable as the quadrature Sq = J pdq. 
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The momentum p{q) may be taken from Eq. (3.44) with H eliminated in 
favor of I according to Eq. (3.45), thereby giving the action 

So(q,I) = j \/2muj 0 (I - m.LU 0 q 2 / 2) dq. 

The Poincare invariant is just this integral over a closed contour in the ( q,p ) 
plane: S = §^pdq. This contour integral can be shown to be 

5 = 2t tI. 



The action So(q, I) generates a canonical transformation in the G(q , P, t) 
format of Eqs. (3.29) and (3.30) with G(q,P,t ) = S(q,I,t) = — £t + So(q, I) 
and with coordinates P(q,p) = / and Q(q,p) = a. From Eq. (3.30) one 
finds 



dS r- /T ^ 7 — dS 

p = — = s/2 mu 0 (I - mu 0 q z /2), a = — = cos 



1 (^s/rruJsjMqj . 



These relationships show that the primitive coordinates (q,p) are obtained 
from the elementary flow (a, I) as 



q(a,I) = \J 21 /mu>o cos a, p(a,I) = y / 2/rnwo sin a, 



with the inverse relationships 

a(q,p) = taiP 1 (p/mu 0 q), I(q,p) = {p 2 /2m. + nq 2 /2 )/uj 0 . 

The invariant / = H/tu 0 is a single- valued function of ( q,p ). The other 
half of the invariant set is (3 = a — uit or 

f3(q,p) = t&iP 1 (p/miooq) - oJot, (3.46) 

and is seen to be a many- valued but periodic function of ( q,p ) with period 
2tt. 

The harmonic oscillator of Hooke motion has a unique phase space — it 
is Euclidean as shown by the metric condition embodied in its Hamilto- 
nian (3.44) and Fig. 3-11. The canonical transformation of the elementary 
flow can be seen to be a transformation from rectangular to cylindrical 
coordinates in this Euclidean space. 

The flow of Hooke motion in phase space consists of concentric circles 
in the {q,p) plane whose radii are the invariants I. The projections of the 
invariant sj I on the q and p axes give the flow in primitive coordinates. 
These projections are specified by the angle coordinate a whose phase is 
fixed by the invariant (3. 
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Figure 3-11. Phase Space of the Hooke Oscillator. The elementary flow consists of con- 
centric circles whose radii are the invariants VI and whose angular configurations are 
fixed by the coordinate a. The canonical transformation of the elementary flow to primi- 
tive coordinates consists of projections of the invariant VI upon the primitive coordinate 
axes q , p — a cylindrical to rectangular coordinate transformation in this Euclidean space. 

The Complexity of Motion 

The elementary flow is simple. It removes the complexity of motion from 
the Hamilton equations and relocates it in the action of the canonical trans- 
formation which builds the motion in primitive coordinates. But whence 
comes the action S{q,I,t)l The action is the solution of the Hamilton- 
Jacobi equation — an equation which contains all the complexity of the 
Hamilton equations and whose solution is obtainable with a degree of dif- 
ficulty equal to that of solving the Hamilton equations in primitive coordi- 
nates. 

Although nature expresses the trajectories of motion from their under- 
lying Hamilton or Hamilton- Jacobi equations with ease, we who seek to 
form an image of motion have a more difficult task expressing this image. 
Finding the integrals of the Hamilton equations in primitive coordinates 
or the solution of the Hamilton- Jacobi equation for the generic motion of 
many bodies turns out to be a forbidding task. 




Figure 3-12. Complex Motion. Many primitive coordinate pairs (q,p) are entangled 
within each invariant manifold I, (q. p) of a complex motion. 
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The motion of many bodies is complex. This means it is impossible to 
exhibit the solutions of their Hamilton or Hamilton- Jacobi equations even 
as transcendental functions of the primitive coordinates — functions of great 
mathematical generality consisting of infinite series of algebraic polynomi- 
als. (We shall see why this is so in Chapter 7.) Numerical integration is 
the only generally available method for finding these integrals. 

Since each of the s elementary flow pairs (cq, If) is simple, the complexity 
of motion arises from the action S(q, I , t) which links them to the primitive 
coordinates ( q,p ). A typical invariant /; of a complex motion is linked to not 
one but all s pairs of the primitive flow coordinates (g,p) in its canonical 
transformation as indicated in Fig. 3-12. The fact that many primitive 
coordinate pairs ( q,p ) are entangled within each invariant manifold Ii(q,p) 
makes the generic motion of many bodies complex. 

Motions so structured by their symmetries that each invariant can 
be built from only one corresponding primitive coordinate pair {qi,pfj are 
freed of their complexity. The primitive coordinates do not entangle within 
the invariants and are said to be separable , as in Fig. 3-13. When this hap- 
pens, the Hamilton equations in primitive coordinates will be found to be 
integrable by quadrature, as was the case for the Hooke oscillator (we shall 
shortly describe how this happens more generally). Separable motions are 
also known as integrable motions and the two notions are interchangeable. 
Integrable motions lack complexity because the simplicity of the elementary 
flow is preserved in the canonical transformation to primitive coordinates. 




Figure 3-13. Separable Motion. In contrast to complex motion, only one primitive 
coordinate pair (qi,Pi) is linked to an invariant manifold p, ) of a separable motion. 

Symmetry competes with the degrees of freedom of a motion to deter- 
mine its integrability. Symmetry imposes order which separates the co- 
ordinates among the invariants; increasing degrees of freedom dilute that 
order allowing some of the coordinates to entangle. Two-body motion is 
so strongly structured by its symmetries that it is an integrable motion 
whereas the motion of many bodies is not. Though subject to Galilean 
symmetries, the motion of more than two bodies has sufficient degrees of 
freedom to evade the order imposed by the Galilean symmetries. 

The separation of the coordinates by symmetry has a corresponding im- 
pact on the way in which the flow fills the phase space. Symmetric flows 
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possess sameness ; they are made of repetitive patterns. Less information 
is therefore required for the description of a symmetric flow than for a non- 
symmetric flow. One can think of each point of phase space as a repository 
of information about the flow. (The position coordinates tell which points 
presently experience a flow and the momentum coordinates point out the 
adjoining points which experience the same flow.) The maximum informa- 
tion that can be packed into phase space occurs when every point is used 
by the flow. A symmetric flow which contains repetitive patterns does not 
require the full information storage capability of phase space. Some points 
are surplus and need never be visited by the flow. A highly symmetric 
flow need arrange itself over very few points of phase space. A flow lacking 
symmetry is so laden with information that it consumes the entire phase 
space. 

One is accustomed to the Galilean invariants which are single-valued 
functions of the primitive coordinates; but generic motions possess a full 
range of invariants that include but also go beyond the Galilean invari- 
ants. The additional invariants beyond the Galilean invariants for complex 
motions are many-valued functions of the primitive coordinates. Complex 
flows have many-valued invariants because they wander over large num- 
bers of points of phase space. Since each flow is identified by an invariant, 
the invariant I = consti of a complex flow is visited over large volumes 
of points (q,p) as the flow wanders about. Additional flows defined by 
/ = const 2 , const^, . . . also distribute their invariant values over large vol- 
umes of (q,p). 

In a finite phase space with incompressible volume it is inevitable that 
flows — defined by different invariant values I = consti, consti, ■■ ■ which 
each occupy large volumes of points in a finite region of phase space — will 
intersect. (This is another implication of the incompressibility of the phase 
flow.) Points (q,p) within the intersections correspond to many values of I. 
Flows which consume large regions of phase space intersecting one another 
form complex tangled webs. 

If a motion is sufficiently lacking in symmetry, a given point ( q,p ) may be 
swept by an infinite number of flow surfaces defined by the invariants I = 
consti , const 2 , . . . . The invariants of motion whose flow consumes every 
point of a region of phase space (such a flow is said to densely fill the phase 
space) are infinitely many- valued functions of ( q,p ). In contrast to complex 
flows whose invariants are many-valued, an integrable motion has invariants 
I{q,p) which are single- valued functions of the primitive coordinates. 



The Simplicity of Motion 



The counterpoint to complex motion is integrable motion. Integrable 
motion possesses symmetries which separate the coordinates: each invariant 
Ii is related to only one primitive coordinate pair ( qi,Pi ) as in Fig. 3-13. 
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Symmetry renders motions separable and separability in turn leads directly 
to integrability. 

A separable symmetry splits the Hamilton- Jacobi equation into parts, 
each of which is an invariant depending upon a single primitive coordinate 
pair. This occurs when a symmetric motion is cast in a set of coordinates 
qi = qi(x a ) which possess that very symmetry. Any coordinate system 
formed by independent lengths or angles may be utilized to specify a point 
in space. Each coordinate system also embodies particular symmetries. 
As the most elementary example, the rectangular coordinate system has 
planar symmetries formed by mutually orthogonal sets of planes X\ = const, 
X 2 = const, X 3 = const. Other coordinate systems also exist which possess 
particular symmetries such as spherical, cylindrical, conical, and others. 

In a coordinate system whose symmetries match the symmetries of the 
motion, the Hamilton- Jacobi equation — a partial differential equation in 
s = 3 n dimensions — splits into s ordinary differential equations whose in- 
tegrals are quadratures. The whole action of the motion S(q,t) separates 
into a sum of “partial actions” S* (qf), each of which depends upon only one 
position coordinate, 

So (q) = ^Siiqi). 

i 

The momenta are given by pi = dS/dqi = dSo/dqi, or 

Pi = d/dqi '^2 Sj(qj) = dSi{qi)/dqi 
3 

and each momentum coordinate is expressible as a function of only its 
canonical mate Pi{qf). The partial actions are therefore integrable by 
quadrature, 

Si(qi) = j Pi(qi) dqi (3.47) 

(no implied sums are intended in the above expressions). 

For bound motion a particular coordinate pair ( qi,Pi ) can only take 
values in a finite range; but the time for the motion is not bounded. The 
coordinate pair must, over the run of time, eventually retrace the same 
points again and again. Each action integral S^qf) therefore consists of 
repetitive increments, increasing by the same amount A,S', every time a 
complete cycle is completed by the coordinate pair ( qi,Pi )• The action 
integrals (3.47) for bound integrable motion are therefore infinitely- many- 
valued functions of the primitive coordinates: any S-fq) + nAS) for n = 
0, ±1, ±2, . . . corresponds to the same value of q. 

The Si(q.i) are almost invariants, but not quite; they do not satisfy the 
conservation law (3.33) because of their continual increase with time. More- 
over, they are many-valued (actually, infintely-many-valued) functions of 
the primitive coordinates. Nonetheless, in a single stroke one both turns 
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the Si into invariants and makes them single- valued. One does so by defin- 
ing a single-valued quantity that contains all the information contributed 
by that coordinate pair as 

Ji = (27t)~ 1 AS'; = (27r) _1 j) pidgi , (3.48) 

where § indicates integration over one cycle of the motion and the factor 
2-7 t is inserted for later convenience. 

The quantities Ji are not only invariant; they are single- valued invariants 
known as action invariants. The invariants / = ( Ji, J 2 , • • • , J s ) are the 
invariant elementary flow coordinates made single-valued functions of the 
primitive coordinates.* The a of Eqs. (3.32) are their canonical mates and 
the trajectories (3.32) for integrable motion are 

ccj = u>it + j3i, Ji = const, 

with uJi = dH/dJi . 

A close connection exists between the Poincare invariant S = j> y~b pj dqi 
and the action invariants Ji = (27t)^ 1 f pi dqi (no sums); and it is important 
to recognize their similarities and differences. The Poincare invariant is 
a sum over all the coordinates; each action invariant involves only one 
coordinate pair. The Poincare invariant exists for all motion, integrable or 
not; the action invariants exist only for integrable motions. Both invariants 
are integrals over closed contours; but the contour for the Poincare invariant 
can be any closed contour in phase space whereas the contour for the action 
invariants is not arbitrary. It lies on a phase trajectory over a complete 
cycle of periodic motion. The contour of the Poincare invariant need not 
be oriented in any special way just as long as it is closed. It therefore will 
generally be swept through phase space by the flow. But the contour for an 
action invariant lies along the flow. It remains stationary in phase space. If 
the motion is integrable and one chooses identical contours for the Poincare 
invariant and the action invariants, then the Poincare invariant is seen to 
be a sum of the action invariants: 

S = 27r( Ji + J 2 + ■ • • + Js)- 

The reduction of the many-valued invariants generic to nonintegrable 
motion to single-valued invariants is a sine qua non of integrable motion. 
The invariants of a complex motion are many-valued functions of the prim- 
itive coordinates. A many-valued function of a complex motion illustrated 
in Fig. 3-14 (a) is steadfastly many- valued. Its many- values occur without 
order and there is no way one can construct a non-trivial single- valued func- 
tion from it. The lack of order in the many-values makes such a function 
complex. 

* To see that the Ji are invariant, recognize that the contour of integration lies along 
the flow and therefore remains stationary. One can then calculate J, = <f>(pidqi + Pidqi) 
(no sum), integrate the term f p,d.q, by parts with the result j, = f (p, dq, — qjd'Pi). 
This result may be arranged to A Ji = Jidt = j>(dpidqi — dqidpj) = 0. 
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Figure 3-14 ■ Complex and Simple Many-Valued Functions. A complex many-valued 
function I{q) lacks order in the occurrence of its many- values (a). A periodic many- 
valued function S(q) is not complex because its many- values occur in a simple order (b). 
The whole function is created by repeating this simple pattern 



But if a many- valued function like the partial action Si(q) happens to 
be periodic, its many-values will occur in an ordered fashion, as shown in 
Fig. 3-14 (b). A periodic many- valued function is not complex. It is pe- 
riodicity that distinguishes the many- valuedness of integrable motion from 
the many-valuedness of complex motion. Each many-value is simply an in- 
teger multiple of a single- valued function. It may therefore be made single- 
valued, as in the method of Eq. (3.48). Many-valuedness that is periodic 
takes motion from the realm of complexity and makes it simple. 

Since the form of the action is known in Eqs. (3.47) and (3.48) for in- 
tegrable motion, the topology of the elementary flow of bound integrable 
motion is known. Each action J* is a “radius” of its circle (one-torus) and 
the canonical mate a is an angular coordinate on this circle. The torus of 
the complete motion is a Cartesian product of all these circles. Moreover, 
since the coordinates are separable, each pair of elementary flow coordi- 
nates S.j = (a.j, J-i) and its torus are linked to only one primitive coordinate 
pair = (qi.Pi) and its two-dimensional manifold (rather than to all of 
them as in the non-integrable case). The submanifolds are therefore not 
glued up into complex topologies in building up the complete phase flow 
but are rather “tori all the way up.” 

The invariant manifold for one integrable mode is a circle for which the 
amplitude is the invariant radius J,; and the angle spins at an invariant 
frequency uii as shown in Fig. 3-15 (a). A second integrable mode conflated 
with the first spins the first circle around a second center with the radius 
of the second action invariant and with an angle that spins at the second 
frequency creating the 2-torus as shown in Fig. 3-15 (b). Conflations with 
the third and higher modes create higher-dimensional tori which cannot be 
visualized in three-dimensional space [imagine the 2-torus being spun round 
like the circle to create a 3-torus as indicated in Fig. 3-15 (c); the 3-torus 
spun again to create a 4-torus, and so on]. The trajectory flow swirls upon 
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the surfaces of these tori. 

The flow lines of the composite flow may or may not close upon them- 
selves. If they close, then there are points on the surface never visited by 
the flow. If the streamlines are open, the flow covers every point on the 
torus. (Although the flow densely covers the torus, it does not densely 
cover the phase space because, being the flow of an integrable motion, it is 
confined to the surface of the torus.) 






Figure 3-15. Tori as the Invariant Manifolds of Integrable Motion. A single integrable 
mode generates a circle whose radius is the action invariant J and whose angle a spins 
at the mode frequency (a). A second mode conflated with the first rotates the first 
circle on a radius equal to a second action invariant thereby creating the torus (b). 
Conflation with the third mode creates a torus with a torus [it cannot be visualized in 
three-dimensional space; Fig. 3-15 (c) is merely suggestive]. The complete manifold is 
an s = 3n-dimensional torus. 



Integrable motions are exceptional. A phase space of invariant tori, the 
signature of integrable motion, is likewise exceptional. Two-body motion 
is one of these exceptional motions. All two-body motion (no matter the 
force law) is integrable. Aside from the motion of a free particle, whose 
separability is trivial, two-body motion is the only many-body motion that 
is rigorously integrable. 

There exist idealized many-body situations in which integrable motions 
can also be found. The most important of these is the rigid body — an 
ensemble of arbitrary numbers of particles rigidly linked to one another. 
Rigidity is the key to separability, removing all the degrees of freedom of 
the motion except six which separate into six of the Galilean invariants. 
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Integrable motion is highly specialized. One is therefore delightfully 
surprised to find that integrable motion describes much of the essential 
features of the world including the solar system and the atom. Moreover, 
integrable motion offers more insight into the generic motion of many bodies 
than one might expect from its specialized nature. This is because most 
many-body motion may be represented as a perturbation of many isolated 
two-body motions. In this sense, all many-body motion lies close to a 
collection of integrable two-body motions whose invariant tori have been 
distorted or broken by one another’s presence. Integrable motions and their 
invariant tori subjected to small perturbations are therefore windows into 
many-body motions which reveal properties which lie well beyond those of 
two-bodies. We shall see how this occurs in Chapter 7. 



Ultimate Simplicity 

Bound, integrable motions consist of an ensemble of independent oscillat- 
ing modes, each mode defined by a pair of canonical invariants, Xi = (Pi, Jf) 
which fix its amplitude and phase. Each mode has a phase velocity (“fre- 
quency”) oJi = dH/dJi and its motion is an identity transformation of the 
elementary flow a,; = + Pi, Ji = const. 

Although integrable motions lack complexity, they nonetheless can ex- 
hibit structure rich enough to be superficially mistaken for complex mo- 
tions. This is because although each oscillator of action ,S', supports a 
periodic motion with frequency u>i, the ensemble action Sq = JA S', built 
from them is in general aperiodic: the separable modes of the motion are 
completely unsynclrronized and the motion as a whole never repeats itself. 
The fact that an ensemble of periodic motions is not generally itself peri- 
odic is an important generic property of integrable motion; and it is this 
aperiodicity that gives such motions an illusory semblance of complexity. 

This property is illustrated in Fig. 3-16 where one regards each separable 
mode as the motion on a clock face, each clock hand rotating at the mode 
frequency. If all the motions are initiated with the hands pointing at high 
noon, the ensemble of hands will generally never simultaneously return to 
this position (or any other) for arbitrarily given frequencies uii (although 
each individual clock will return to its initial position) . 

A significant change in the character of integrable motion occurs when 
two or more of its mode frequencies uii are integer multiples of some funda- 
mental frequency uiq- (The frequencies are then said to be rationally com- 
mensurate.) When this happens, the motion is said to possess a degeneracy. 
If all frequencies are integer multiples of some fundamental frequency, the 
motion is fully degenerate and becomes periodic. (For s degrees of free- 
dom, an ensemble may possess up to s — 1 degeneracies, the limit at which 
it is fully degenerate.) There exist times at which the hands of rationally 
commensurate clocks all return simultaneously to the same position as il- 
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Figure 3-16. Aperiodic Motion from Periodic Motion. Each mode of an integrable 
motion may be represented as a clock face. Motion initiated with all modes at high noon 
(a) will generally never return to this state (b). When the frequencies are rationally 
commensurate t Oj = Ijioo with lj an integer and cjo the lowest frequency, the motion 
becomes periodic. The initial state is repeated after intervals of time t = 2ir /ujq (c). 

lustrated in Fig. 3-16 (c). The difference between two such successive times 
is the period of the ensemble. 

An integrable motion is generally aperiodic with structure rich enough 
to support flows which densely cover its invariant tori. But if an integrable 
motion possesses a degeneracy, its structure is further simplified and the 
flow no longer densely covers all its tori. Portions of the motion now become 
repetitive. If the motion is fully degenerate, the flow acquires its maximum 
simplicity and consists of streamlines which close upon themselves. 

Degeneracy is a critical distinction in integrable motion. If no degeneracy 
exists, the motion is aperiodic; but when degeneracy is present, correspond- 
ing parts of the motion become periodic. At full degeneracy, the motion 
becomes fully periodic. Generic integrable motion is therefore said to be 
quasiperiodic : it can be either aperiodic or periodic depending upon its 
degeneracy. 

The role of degeneracy in integrable motion may be further illuminated 
by representing any mechanical quantity F(q,p) as a Fourier series in the 
angle variables on = to it + ftp. 

OO 

F(q,p)= J2 (3.49) 

mi ,m 2 ,...,m s =-oo 

The Fourier coefficients F( J) mi , rn2 ,...,m a describe the way in which the am- 
plitudes of each integrable mode are combined together so as to represent 
any particular motion. The dependence of F on ( q,p ) is given through the 
canonical transformations (3.40) of /3 = f3(q,p), J = J(q,p). 

Let us use the shorthand notation m = (mi, m 2 , ■ ■ ■ ,m s ) for the s sets 
of integers in the Fourier expansion as well as F m (J) = F( J) mi , m2> ..., ms for 
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the Fourier amplitudes. The ensemble frequency corresponding to any set 
of integers m is 



= miWi(J) + m 2 to 2 {J) + • • • + in s u) s (J). 



The corresponding phase of the ensemble is 



( i>m(0 ) = mi(3i + m 2 (3 2 H 1- m s /3 s . 

One may therefore express the angle variables in terms of their frequencies 
and phases in Eqs. (3.49) with the result 

OO 

F(q,p) = E 

m =— oo 

The mechanical quantities of an integrable motion are a sum of oscillations 
of frequencies and phases (f> m {P )• The canonical pair of invariants 

thus enter into the motion in distinctly different ways. The action invariants 
J fix the frequencies il rn (J) while the invariants f3 fix its phases </> m (/3) 
(hence the name “phase invariant”). 

One can now observe that if ratios of the frequencies w.j are not rational, 
the ensemble frequency Ll m will not be a rational multiple of some funda- 
mental frequency loq. The quantity F(q,p) will then be aperiodic. This is 
the generic case of integrable motion. 

Suppose now two frequencies are rationally commensurate in the form 

u>i = Zia>0j u>2 = ^ 2 ^ 0 , (3.50) 

where l± and l 2 are integers. These two frequencies now appear in the 
ensemble frequency fl m in the form 



+ m 2 u> 2 = (toi?i + m 2 l 2 )u> 0 . 

But now a remarkable thing has happened. The coefficient ( mill + in 2 l 2 ) 
is itself an integer. Since the sums in Eq. (3.49) run over all the integers, 
one can just as well write 



m! = m,ih + m 2 l 2 



and sum over the single set of integers m' rather than the two sets mi and 
m 2 . The existence of a degeneracy has reduced the number of Fourier sums 
and independent frequencies of the ensemble from s to s — 1: 

where f l' m consists of the s — 2 nondegenerate frequencies. The collapse of 
two infinite sums into one is the origin of the term “degeneracy.” 
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The degeneracy of the frequencies also shows 

dH/d{hJi) = dH/d(l 2 J 2 ) = u 0 . 

When a degeneracy exists between two frequencies such as u>\ and lo 2 , the 
actions J\ and J 2 appear in the Hamiltonian H(J) in the rationally com- 
mensurate form (l\J\ + l 2 J 2 ). The converse is also true. If a Hamiltonian 
depends upon the action invariants in the form {l\J\ + l 2 J 2 ), then the mo- 
tion possesses a degeneracy and has one less independent frequency. 

Each additional appearance of a frequency in the form u>k = /fcWo> where 
Ik is an integer, signals the appearance of an additional degeneracy. When 
all frequencies are rationally commensurate the motion is fully degenerate 
with only a single fundamental frequency u>o and a single set of integers, 
m! = Zimi + l 2 m 2 + • • • + l s m s . The Fourier representation degenerates to 
the single sum 



OO 

F(q,p)= J2 F m'U 3,J)e im ' Wo(J)t . 

m' ——oo 



The existence of degeneracy also has an important impact on the phase 
invariants. The phase invariants (3 are generally many-valued. Although 
many-valued, the phase invariants are periodic. Each f3j increases by 2ty 
every time a coordinate pair ( qj,Pj ) returns to the same state as in the 
example of the Hooke oscillator, Eq. (3.46). The many-valuedness is there- 
fore not complex; one may construct a single-valued function by taking a 
trigonometric function of the phase. Thus, the many-valued periodic in- 
variant (3j may be made into a single- valued invariant: 




Although each Bj, considered in isolation, is a single- valued function of 
its phase (3j, the various fij are functions of {q,p). Hence, the Bj considered 
as functions of ( q,p ) cannot in general be single- valued because cycles of 
(q,p) induce cycles in /3 which are not in synchrony. If the motion pos- 
sesses no degeneracies, one may construct only one additional single- valued 
invariant, say B\ = e*^ 1 . One single- valued phase invariant always exists. 

On the other hand, if a degeneracy (3.50) exists between u>\ and u> 2 , 
then the cycles of f3\ and f3 2 come into synchrony. The invariant B 2 is 
then also single-valued if Bi is single-valued. For each degeneracy given by 
ajk = IkW o, additional single-valued invariant Bk comes into being. 

To sum up, the action invariants Ji of an integrable motion are always 
single-valued functions of the coordinates; the phase invariants Bj = e l ^ j 
need not be. But if the motion is degenerate, the phase invariants also 
become single- valued. 
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The flow of a generic, nonintegrable motion fills the entire manifold of 
phase space. When motion possesses a symmetry that allows it to be in- 
tegrable, a single-valued action invariant Ji exists for the integrable mode 
corresponding to that symmetry (its corresponding phase invariant Bi gen- 
erally remains many- valued) . The flow no longer fills the full manifold of 
phase space of dimension 2 s = 6n. It now lies upon a submanifold -the 
union of a one-dimensional torus upon which the invariant is constant and 
the remaining 2(s — l)-dimensional phase space of the nonintegrable coor- 
dinates. 

Single-valued invariants are called isolating invariants. They confine the 
flow to smaller submanifolds, thereby isolating it from the full manifold of 
phase space. If the motion is fully integrable, then s single- valued invariants 
exist and the flow lies upon an s-dimensional torus. The flow of a fully 
integrable motion lies upon a manifold half the size of the manifold of its 
phase space. 

Can the flow be isolated to a yet smaller manifold? It can. When a fully 
integrable motion is also degenerate, the manifold upon which the flow lies 
will be further reduced. This is because the phase invariants also become 
single-valued when degeneracy exists; and it is the single-valuedness of in- 
variants that leads to isolation. In addition to the s single- valued invariants 
J a degenerate motion now includes the single- valued invariants B ; , one for 
each degeneracy. The flow correspondingly occupies a manifold of dimen- 
sion less than s, a decrease of one dimension for each degeneracy. If the 
motion is fully degenerate (a maximum of s — 1 degeneracies), all invari- 
ants are single-valued and the flow closes upon itself on a one-dimensional 
torus — the knotted circle (it is knotted because the streamline may wind 
more than once around the torus before closing). 

Flows may occupy a manifold of dimension as great as 2s or as small 
as one dimension depending upon their symmetries and the corresponding 
single-valued invariants they possess. It is this vast range in the dimen- 
sion of the flow manifold that spans the most simple to the most complex 
motions. 

Only s single-valued invariants are required to integrate the Hamilton 
equations (3.15). Integrable motions have at least this number of single- 
valued invariants constructed in Eq. (3.48). But when a motion is degen- 
erate, more single- valued invariants exist and the set exceeds s. One may 
therefore assemble more than one set of s invariants (the actions J ) in 
which the motion is integrable. This means there exists more than one set 
of coordinates in which the Hamilton- Jacobi equation is integrable. This 
is the case for two-body motion with inverse-square forces. 

In summary, integrable motion lacks complexity; nonetheless it exhibits a 
richness of structure that ranges from simple one-loop closed orbits to open 
orbits so dense and seemingly irregular they can be superficially mistaken 
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for those of nonintegrable motion. Isolating invariants and the degeneracy 
they induce distinguish the range of simplicity in integrable motion. A 
fully degenerate motion is periodic and its orbits are closed curves. A 
nondegenerate motion has the maximum richness of structure possible in 
integrable motion. It is aperiodic and its orbits wind densely over its torus 
without closing. 



The Algebra of Symmetry 

The Galilean symmetries of mechanics are transformations of space and 
time to which mechanical quantities which are functions of space and time 
such as the energy H , momentum P, mass-center IV, and angular momen- 
tum J are invariant. The configuration space in which the particles reside, 
however, is only half of the wider world of phase space in which motion 
fully exists. The full symmetries of motion are the symmetries of phase 
space. 

Phase space naturally possesses Galilean symmetries because it contains 
configuration space. But phase space also possesses another fundamental 
symmetry flowing from the action principle itself: symplectic symmetry. 
The fundamental symmetries of phase space are the Galilean and symplectic 
symmetries. Additional symmetries come into being with specific motions. 
They are embodied in the particular Hamiltonians of those motions. They 
represent additional transformations of phase space into itself to which the 
mechanical quantities defined on it are invariant. 

How are mechanical quantities imprinted by a particular symmetry? Re- 
markably, symmetries reveal themselves in the algebraic behavior of these 
quantities, in particular, in the behavior of products of mechanical quanti- 
ties. This mechanical algebra, inspired by the Norwegian mathematician 
Sophus Lie at the end of the nineteenth century, describes what are called 
the generators of a symmetry. The generators are matrices and correspond 
to infinitesimal actions of the symmetry. For example, three-dimensional ro- 
tations through the angles Sf2, illustrated in Fig. 3-3, have the infinitesimal 
transformation x' = 6f2xx which can be expressed as, x[ = Xi+ekijSflk Xj. 
The alternating symbols appearing here can be arranged as three 3x3 ma- 
trices that are the generators of this transformation: 

( d \ ) i'j = C | , ( d‘2 ) ij — C‘2ij - ( 'A ) ij ~ t3ij ■ 

An infinitesimal rotation expressed in terms of generators is then 

Xj - \S, : j A (d k ) i'j Ah,:] Xj , 

and a finite rotation (see the footnote on p. 30) is effected by exponentiating 
the generator: 

A Jk)ij£lk ~ . 

it/ n C- U/ 1 . 
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The key to the Lie algebra is that it is non-commutative and the commu- 
tator of products of the generators contains the essential structure of the 
symmetry. For example, the commutator of the three-dimensional rotation 
generators is readily shown to be 

( J%Jj JjJi ) ^i.jkJk 



The Lie algebra, which describes the generators of the symmetry, reflects 
itself in a similar commutator for the invariants associated with the sym- 
metry. That parallel structure to the Lie commutator of generators is the 
Poisson bracket of the corresponding invariants of the symmetry. Whereas 
the Lie generators of a symmetry are matrices, the invariants of that sym- 
metry are dynamical variables of the motion. Angular momentum is the 
invariant corresponding to rotational symmetry. It possesses a Poisson 
bracket which parallels the Lie commutator of the rotation generators: 



[ Ji, Jj\ 



/ dJi dJj dJj dJi\ 

V 9qk dp k dpk dq k ) 



tijkJk- 



Hence, the structure of the Poisson bracket commutator is a manifestation 
of the symmetry originating in the Lie commutator. The Lie commuta- 
tor acts on the generators of the symmetry. The Poisson bracket acts on 
the invariants of the symmetry and, like the Lie commutator, reveals the 
structure of the symmetry: 



Lie bracket — > symmetry generators, 

Poisson bracket — > symmetry invariants. 

Products of mechanical quantities of interest may be arrayed in a “mul- 
tiplication table” in which the product is defined by the Poisson bracket. 
This multiplication table is a portrait of the corresponding symmetry. By 
examining the patterns of its products, one discerns the structure of the 
symmetry. For the elementary Galilean symmetries of mechanics, the in- 
variants of interest are H, P , N , and J, and their multiplication table 
is 





H] 


Pj] 


Nj] 


Jj] 


[H, 


0 


0 


P, 


0 


[Pi, 


0 


0 


Sijm 


£ ijkPk 


[Ni, 


- Pi 


—8ijm 


0 


£ij kN k 


[Ji, 


0 


^ij k *k 


^ij k Af ; 


€ ij k Jk 



Four symmetries are contained in this table, each symmetry carried by 
a particular invariant. They are translations in time ( H ), translations in 
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configuration space (P), translations in velocity space (IV), and rotations 
(J). Products of the components of an individual invariant vector, such 
as [ Ji , Jj\, describe the symmetry carried by that invariant and lie on the 
diagonal of the table. Products of the components of invariant vectors 
carrying different symmetries, such as [Pi, describe the joint action of 
the two symmetries when they are combined and form the off-diagonal 
products. 

Time-translational symmetries described by H are simple. The products 
of all quantities with H are zero except for N for which it is — P. This is 
because N = Pt — mX is an explicit function of time. The Poisson bracket 
[N,H] = — P must cancel the time derivative dN/dt = P so as to yield 
d.N/dt = 0 according to Eq. (3.33). 

Translational symmetries in configuration space and in velocity space 
described by P and N are also simple. Their self-Poisson brackets on the 
diagonal vanish while their off-diagonal product vanishes unless both are 
components in the same direction, in which case their product yields the 
total mass of the ensemble. Translations in configuration space couple with 
translations in velocity space like canonical invariants. The invariants P 
and —N/m have a Poisson bracket behavior that is identical to that of the 
invariants I and (3. 

Rotational symmetry is described by that portion of the multiplication 
table containing the elements J,; and are richer in structure than transla- 
tional symmetry. Quantities possessing rotational symmetry, such as the 
angular momentum, have the cyclic Poisson bracket which, in contrast to 
the brackets for translations, does not vanish. A product of any two of 
the three elements J\, J 2 , J 3 is the third. Moreover, the magnitude of the 
elements of rotational symmetry has a vanishing Poisson bracket with all 
elements, [J 2 , Ji) =0. 

Translations and rotations are distinct symmetries. Nonetheless, they 
do couple in a general frame of reference as indicated by the nonvanishing 
off-diagonal products [. Pi,Jj ] and [. Ni,Jj ]. This coupling is superficial. It is 
eliminated in a frame of reference translating with the center of mass and 
known as the center of mass frame of motion. 

All motion may be decomposed into a center of mass motion described 
by P and N whose angular momentum is 

X x P = P x N jm 

and a relative motion observed in a frame of reference moving with the cen- 
ter of mass. The center of mass motion is free of forces and its total energy 
is the kinetic energy P 2 /2m. The energy TL and the angular momentum 
M of the relative motion are therefore 

H = H- P 2 /2m, M = J — P x N/m. 

The translations and rotations coupled in the Poisson brackets of H, 
P, N , and J in a general frame of reference uncouple into two pieces: a 
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center of mass component described by P and N and a relative component 
described by Tt and M. The center of mass components have the Poisson 
brackets 



[Ni, Nj] = 0, [N it Pj ] = -Sijm, [Pi, Pj } = 0. 

The center of mass motion is integrable and constitutes a six-dimensional 
phase space with elementary flow identical to the flow in primitive coordi- 
nates, 

Xi = (. Pi/m)t — Ni/m, Pi = const. 

Its invariants are X = ((3,1) = (— N/m,P ). The action of the center of 
mass motion is 

S = —£t + PiXi 

and is an identity canonical transformation of the elementary flow. 

The center of mass motion of all motions is always known. All the 
complexity of general motion rests in the remaining 6 (n — l)-dimensional 
relative motion whose Galilean invariants are Tt and M with the Poisson 
brackets 



[H, Tt] = 0, [Tt, Mi] = 0, [Mi,Mj] = e ijk M k . 

Symmetries create invariants; and the invariant of symplectic symmetry 
is the zero element of the Poisson bracket algebra. All mechanical quantities 
manifest the symplectic symmetry of phase space in the zero element: the 
product of a mechanical quantity with itself (i.e. , Poisson bracket of a quan- 
tity with itself) vanishes. Although the components of quantities like M do 
not have vanishing Poisson brackets among themselves, quantities may be 
constructed from them which have a vanishing Poisson bracket with every 
element of the algebra. If one restricts attention to the angular momentum 
describing three-dimensional rotational symmetry, the magnitude M 2 or M 
is such a quantity with vanishing Poisson brackets with all components as 
indicated by Eq. (3.19). Such a quantity is the key to the organization of 
the phase flow by symmetry and is known as a Casimir invariant after the 
twentieth century physicist H. B. G. Casimir who first distinguished them 
in his classic work on the mechanics of rigid body rotations. 

A Casimir invariant is a quantity that has a vanishing Poisson bracket 
with every element of the symmetry. For translations in configuration space, 
all the elements (Pi, P2, P3) are Casimir invariants as are (Ni, N%, IV3) for 
translations in velocity space. 

In the case of rotations none of the elements of the symmetry algebra 
is a Casimir invariant. The Casimir invariant of rotational symmetry is 
the relative angular momentum magnitude M. In contrast to J, it is also 
a Casimir invariant of translations since it has vanishing Poisson brackets 
with P and N. The relative energy Tt is easily seen to be a Casimir invariant 
not only of time translations but of all the Galilean symmetries. 
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When all the Galilean symmetries of mechanics are combined, only two 
invariants survive with vanishing Poisson brackets with all the elements H, 
P, N , and J. These Casimir invariants of the complete Galilean symme- 
tries are the relative energy TL and the magnitude of the relative angular 
momentum M; and they are possessed by all motions. Particular motions 
described by particular Hamiltonians possess additional symmetries beyond 
the Galilean symmetries. The Casimir invariants expand accordingly. 

In summary, symmetries create an algebra through the Poisson bracket 
as illustrated by the fundamental translational and rotational symmetries 
of mechanics. The algebra of rotations may be uncoupled from that of 
translations by viewing the motion in the center of mass frame. The center 
of mass motion is integrable; it is an identity transformation of the ele- 
mentary flow. The remainder of the motion which contains all its possible 
complexity is contained in the (s — 3 )-dimensional relative motion. For a 
given symmetry, special quantities exist that have vanishing Poisson brack- 
ets with every element of the algebra. These are the Casimir invariants of 
the symmetry. The complete Galilean symmetries of mechanics have two 
Casimir invariants, the relative energy TL and angular momentum magni- 
tude M (the energy and the angular momentum magnitude in the frame 
of motion moving with the center of mass). All motion possesses these 
Casimir invariants. 



A Summary of Symmetry in Invariant Sets 

Symmetry is captured in the Poisson bracket behavior of its invariants. 
Sets of quantities whose Poisson brackets collectively vanish summarize the 
symmetry. This is because they form one-half of a symplectic pair of coor- 
dinates as indicated by Eqs. ( 3 . 34 ) which allow the Hamilton equations to 
be integrable. We shall call sets that contain all the elements of a symme- 
try with collectively vanishing Poisson brackets invariant or Casimir sets. 
Casimir invariants are always members of this distinguished set. 

A Casimir set need not only consist of Casimir invariants. In the case of 
rotational symmetry, one may always select one of the three quantities Mi, 
M 2 , M 3 to place with the Casimir invariants M and TL to form a set whose 
Poisson brackets collectively vanish such as (TL, M, M3). The elements TL 
and M of this Casimir set are Casimir invariants; but the element M 3 is 
not. 

A given symmetry may have more than one independent Casimir set. 
The rotational symmetry of the relative motion turns out to have two. The 
independent invariants of the relative motion are (TL, M\, M 2 , M3). They 
may also be expressed in terms of the angular momentum magnitude M 
and any two components as (TL, M, M3, Mi). It can be easily seen that one 
may combine these invariants together in the two independent Casimir sets 



(TL,M,M 3 ), (TL,M,Mi). 
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If one restricts attention to translations, Casimir sets consist of the two 
complete sets 

(P 1 ,P 2 ,P 3 ), (N^N^Na). 

They form the pair of three-dimensional invariants of the ensemble trans- 
lational motion 1 = (— N/m,P ). 

Casimir sets contain all the contributions of a given symmetry to the 
invariant set I. The sets (Pi,P 2 ,P 3 ) and (Ni, N 2 , N 3 ) are each Casimir 
sets of the translational symmetry possessed by the center of mass mo- 
tion. The sets (7Y,M, M 3 ) and Mi) are the Casimir sets of the 

three-dimensional rotational and time-translational symmetry of the rel- 
ative motion. These Casimir sets for translations and rotations may be 
combined into Casimir sets for the complete Galilean symmetries, such as 

[(Pi,P 2 ,P 3 ),(W,M,M 3 )], [(P 1 ,P 2 ,P 3 ),(H,M,M 1 )}. (3.52) 

The full Galilean group allows the Casimir sets (3.52). All the Casimir sets 
one may form for the complete Galilean symmetries contain a maximum of 
six invariants, a number less than the full number of Galilean invariants, 
which is ten. 

The Hamilton equations have the property that they may be reduced 
to quadratures in pairs for each single-valued invariant which is known 
thereby allowing a reduction of two in the dimension of the phase space for 
each single- valued invariant. One might be led to conclude that since there 
are ten Galilean invariants, the dimension of the phase space of all motions 
may be reduced by twenty; but this is not so. The invariants that effect 
the reduction must form a set with collectively vanishing Poisson brackets. 
These are the Casimir sets; and they contain only six rather than the full 
ten of the Galilean invariants. The reduction in the dimension of the phase 
space of all motions is twelve rather than twenty. 

The fact that there exists more than one Casimir set for a symmetry 
means that observers have the choice of viewing the same motion through 
different sets of mechanical quantities. It also means that the same sym- 
metry is natural to more than one geometry. Rotational symmetry has two 
independent Casimir sets. It is natural to both the geometry of the sphere 
and the geometry of the paraboloid as we shall find. 

Casimir sets show how symmetries organize the canonical structure of 
motion. Symmetries create invariants; and the invariants of mechanics 
1 = (/?,/) are a symplectic pair, each half of which is a Casimir set. The 
flows of a Casimir set all lie on a common manifold of phase space. Casimir 
sets are the building blocks for constructing all integrable motions. If they 
had no other significance than this, Casimir sets would be amply distin- 
guished quantities in mechanics. But in the extension of motion into the 
quantum realm, Casimir sets take on deeper significance. Casimir sets are 
those quantities that can be simultaneously observed without quantum un- 
certainty. It can well be argued that the most important quantities in 
mechanics are the Casimir sets of the symmetries of nature. 
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Reprise 



Our journey with the law of motion began with space and time and culmi- 
nated in the principle of least action. The principle of least action blossoms 
with implications: conservation laws, invariants, the Poisson bracket and 
its Hamiltonian flow, and a revealing algebra of symmetry and invariants. 
We shall follow these implications in a detailed exposition of the two-body 
motions of heavenly bodies and the hydrogen atom. These are nature’s 
integrable motions whose behavior is completely open to us. And we shall 
find that the law of motion reflects the beauty of the physical world in the 
mathematical beauty of its own unfolding form. 




Chapter 4 



Classical Mechanics: The Heavens 



The universe is in motion. Least action is the principle underlying that 
motion. Let us look upon the revelations of this principle for the most 
elementary celestial motion, that of two heavenly bodies such as a planet 
and the sun. The trajectories of the bodies are the final flowering of the 
mathematical description. The action principle has provided their equa- 
tions of motion; however the trajectories do not become apparent until the 
integrals of the equations of motion are found. 

But the action principle provides more than equations of motion. It also 
offers conservation laws created by the symmetries of nature. The conserva- 
tion laws present us with the invariants of mechanics; and these invariants 
provide direct access to the trajectories of two bodies. The motion of two 
bodies may be decomposed into a center of mass component and a compo- 
nent relative to the center of mass. The invariants N and P tell us that 
no matter what complex motion each body executes, the center of mass of 
the two bodies translates at a constant velocity through the celestial space. 
The invariants N and P each constitute an appropriate Casimir set for the 
center of mass motion, each contributing three invariants. The trajectory 
flow of the center of mass consists of straight lines given by 

X = (P/m)t — N /m, P = const. 

Now turn to the relative motion. The energy of the relative motion TL 
is an invariant. We shall find that for energies TL < 0, the two bodies are 
bound together — they must always remain within a finite distance of one 
another. For energies TL > 0, the motion is unbound -the bodies may move 
infinitely far from one another. 

The angular momentum M of two bodies is also an invariant vector 
forever pointing in a fixed direction as the two companions stroll through 
the heavens. By definition, the position and the momentum vectors of the 
relative motion are both perpendicular to the angular momentum; they 
define a plane. The result is exceptional: the entire relative motion is 
confined to a plane perpendicular to the angular momentum vector. 
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Bodies in arbitrary motion may be expected to explore all values of posi- 
tion and momentum in phase space. This is not so in two-body motion. The 
angular momentum is an isolating invariant. It defines a plane permanently 
fixed in configuration space to which the relative motion is restricted. No 
matter how the initial conditions of the bodies may be varied for a given 
angular momentum, the trajectory flow always lies in the same plane. The 
isolating power of the angular momentum is reflected in the fact that it 
contributes not just one but two Casimir sets to the relative motion: 

(H,M,M 3 ), (W.M.M,). (4.1) 

The phase space of the relative motion is a six-dimensional manifold. 
The existence of one Casimir set with three single-valued invariants im- 
mediately shows the motion is integrable and its phase flow confined to a 
submanifold of three rather than six dimensions. But the fact that the an- 
gular momentum contributes two Casimir sets rather than one means the 
motion is degenerate and restricted to an even smaller submanifold of two 
dimensions. When the motion is bound, this submanifold is a two-torus. 
Angular momentum in concert with the energy isolates the trajectory flow 
within a manifold of six dimensions to one of two dimensions — a powerful 
reduction. 

The bound trajectories lie on a plane in configuration space and upon 
a two-torus in phase space; but they are still quite open. The trajectories 
form generally wandering orbits, never closing upon themselves as shown in 
Fig. 4-1 (a). A closed orbit for which the bodies retrace the same streamline 
as shown in Fig. 4-1 (b) represents another degeneracy of the motion. Such a 
trajectory is bound to an even more severely restricted region of space. Not 
only is the motion confined to the plane; the trajectory is closed restricting 
the motion from the full space of the plane. 

If the trajectories of two-body motion are to be closed, there must be an 
additional single-valued invariant beyond Ti and M underlying this degen- 
eracy that fully isolates the trajectory. Is there such an isolating invariant? 
In general there is not. Additional isolating invariants that close the orbit 
do not exist for arbitrary force laws and the orbits are generally open. There 
is one, and only one, inverse-power force law that endows two-body motion 
with another isolating invariant that closes the orbit (Note 2). That force 
is the inverse-square force. 

This coincidence is one of the compelling correspondences we shall find 
in our journey with the two-body problem. Inverse-square forces are special 
forces indeed. They are precisely the forces of gravitation and electricity for 
which bodies move slowly compared to the speed of light. Inverse-square 
force orbits, the orbits of two particles bound by electrical and gravitational 
forces, are closed; the motion is completely degenerate. 

The trajectories of two bodies bound by the inverse-square force are 
closed orbits lying in a plane. What of their shape? One is drawn by a 
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Figure f-1. Closed and Open Trajectories of Two-Body Motion. The trajectory is 
confined to the torus in phase space and the annulus r m i n r ^ r ma x in configuration 

space. In (a), the orbit never closes upon itself. Over the course of time the trajectory 
densely covers the annulus. In (b) the trajectory is closed. Most of the surface of the 
annulus is untouched by the trajectory. 




Figure f-2. The Family of Conic Sections. A plane cuts a cone in the shapes of circle, 
ellipse, parabola, and hyperbola. 



classical sense of perfection to the circle. But the general trajectories of the 
relative motion in the plane cannot be circles; for once one has fixed the 
orientation of the plane of motion by the angular momentum vector, two 
Galilean invariants, the magnitude M and the energy TL, remain. A circle 
possesses only one invariant, its radius. 

Enter the ellipse. It is a closed, smooth curve one step beyond the circle. 
It has two invariants that match the physical invariants TL and M, a major 
axis and a minor axis. But let us not rush into ellipses just yet. There are 
other shapes as well — the full set of conic sections — that satisfy the laws of 
two-body, inverse-square force motion. Conic sections are the curves that 
form the intersections of a cutting plane with a cone as shown in Fig. 4-2. 
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Two-Body Coordinates 



Before proceeding further, it is useful to introduce a natural set of coor- 
dinates for the relative motion of two bodies. The total mass of the bodies 
to and their reduced mass y are 



TO = TOi + 777-2, y 1 = 1 + TO- 2 , 

where in\ and TO 2 are the masses of the two bodies. The relative position 
vector r of body 2 with respect to body 1 and its corresponding relative 
momentum are 

r = x 2 - x-| , p = yr. 

The relative angular momentum expressed in two-body coordinates is read- 
ily shown to be 

M = r x p. (4.2) 

The energy in two-body coordinates is 

Ti = p 2 /2y — k/r. (4.3) 

An unusual connection exists between the energy and the angular mo- 
mentum in two-body motion, a direct result of the existence of the angular 
momentum as an isolating invariant. The momentum may be represented 
as a sum of the component in the radial direction of the position vector 
p r = yr(r/r) and that in the direction perpendicular to it, p±: 



p = p r + p±. 



The radial direction momentum does not contribute to the angular mo- 
mentum so the angular momentum is expressible as M = r x p±. Since 
r and p± are orthogonal, a simple identity shows that the perpendicular 
momentum is contained within the angular momentum: 

p i = ~r x M/r 2 . (4-4) 

Since r and M are perpendicular, the magnitude of the perpendicular mo- 
mentum is given by 

p_ i = M/r. 

This result provides two-body motion a sweeping simplification; for the 
perpendicular momentum need not explicitly appear in any mechanical 
expression. It is contained within the invariant M. The perpendicular 
momentum can be represented as a pure function of r through Eq. (4.4). 
Thus, the total kinetic energy of the motion can be expressed in terms of 
the total angular momentum as 



T = p 2 r /2y + M 2 /2yr 2 , 
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where p r = pr is the radial momentum. This representation of T in terms 
of M 2 is a result of the isolating power of the angular momentum invariant. 
The total energy may be expressed in the compact form 

H=p 2 r /2p+W(r), (4.5) 

where W (r) is the effective potential defined as 

W(r) = M 2 /2pr 2 + V(r) = M 2 /2pr 2 — k/r. (4.6) 

The relative motion in the center of mass reference frame may be re- 
garded as an equivalent one-body problem of mass p, position coordinate 
r, linear momentum p = pr, and angular momentum M = r x p. In this 
equivalent one-body problem in the frame of relative motion translating 
with the center of mass, M and TL are invariants; but the relative linear 
momentum p = pr is not. 



The Hidden Invariant of Kepler Motion 

The isolating invariant that closes the orbit of two-body motion with an 
inverse-square force, f = —dV/dr = — kr/r 3 , is the eccentricity e: 

e = (p x M)/kp — r/r. (4-7) 

One may verify that the eccentricity obeys the conservation law de/dt = 0. 
This is done in Note 3. There it is shown that the Poisson bracket [e, 7 ~L] 
vanishes. Since e does not depend explicitly upon time, it is an invariant 
by Eq. (3.33). 

The eccentricity invariant was first recognized in the early eighteenth 
century by Jacob Hermann, a student and protege of Jacob Bernoulli and 
a favorite colleague of Leibniz. It was elaborated by Simon Pierre Marquis 
de Laplace in the Traite de mechanique celeste of 1799 and discovered, 
apparently independently, by William Rowan Hamilton again in 1845.* 
The eccentricity invariant was introduced into this century by Carl Runge 
in his Vektoranalysis of 1919. It has also been referred to as the Laplace 
vector and the Runge-Lenz vector. 

How is the eccentricity vector oriented with respect to the other fun- 
damental vectors of the motion: r, p, and M? Since r and p x M are 



* Hermann’s invariant may be found in “Extrait d’une lettre de M. Herman a M. 
Bernoulli, datee de Padoue le 12. Juillet 1710” published in Histoires de L'Academie 
Royale des Sciences a vec les Memoires de Mathematique et Physique , Paris, 519—523 
(1712). [For a brief survey of Hermann’s life and works, see F. Nagel, Historia Mathe- 
matica, 18, 36—54 (1991).] For an historical account of the eccentricity invariant see H. 
Goldstein, Am. J. Phys. 43, 8 (1975) and 44, 11 (1976) which includes P. S. Laplace, 
Traite de mechanique celeste , Paris, Tome I, Premiere Partie, Livre II, 165ff (1798—1799); 
W. R. Hamilton, Proc. R. Irish Acad. 3, Appendix No. Ill, xxxviff (1847). 
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each perpendicular to IVf, the eccentricity vector is also perpendicular to 
M and shares the plane of r and p as shown in Fig. 4-11. As an invariant, 
e remains permanently perpendicular to the invariant M ; but the angles 
r and p make with e in the plane that they share will vary because these 
quantities are not invariant. 

It may seem that the three- vector e brings three additional invariants to 
the motion; but this is not so. Since e is expressed in terms of M , these two 
vectors are not independent. The magnitude of e is completely expressible 
in terms of the other invariants H and M. In addition, the vectors M and 
e are orthogonal. They by nature satisfy the symmetry conditions 

e 2 = l+|^M 2 , M • e = 0. (4.8) 

k z fi 

Because of the two conditions (4.8), the eccentricity contributes only one 
additional independent invariant to the motion. 



The Symmetries of Kepler Motion 

The only symmetries present in all classical motions are the elementary 
symmetries of translations and rotations of space and translations in time. 
The only invariants present in all classical motion are the Galilean invariants 
that spring from these elementary symmetries. However, each configuration 
of particles and force law introduces additional symmetries and accompa- 
nying invariants. The inverse-square nature of gravitational and electrical 
forces for two bodies expands the symmetries of the Galilean invariants in a 
remarkable manner: the three-dimensional rotational symmetry generated 
by central forces (and its corresponding invariant, the angular momentum) 
is expanded to a four-dimensional rotational symmetry by the additional 
action of the inverse-square nature of the force (and its corresponding in- 
variant, the eccentricity). It need not be so and, indeed, would not be so if 
the force were not inverse-square. 

Put differently, the symmetries of two-body motion that exist indepen- 
dently of the law of force include three-dimensional rotations and their 
invariant is the angular momentum, the three-vector M. When one fur- 
ther specifies the force is inverse-square, a second three- vector invariant, the 
eccentricity e, comes into being also bearing rotational symmetry; but the 
rotations represented by these two vectors in concert is in four rather than 
three dimensions. (This does not mean the three-dimensional configuration 
space of the motion has become four-dimensional. The space in question is 
the phase space of the relative motion.) 

To see how four-dimensional rotations are the symmetries of two-body 
motion with inverse-square forces one must first grasp the nature of ro- 
tations in spaces of various dimension. The rotation of a vector £ = 
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(Ci j £2 > • • • 5 Cn) iii n-dimensional space consists of transformations 

C' = /?C 

that leave the magnitude of the vector unchanged: 




The matrix R that effects the rotations is an n x n matrix. The unique 
properties that make it a rotation matrix may be established by forming 
the magnitude 

c ' 2 = /?£/?£. 

The product R£ may be rearranged as /?£ = £R* where R' is the transpose 
of R obtained by interchanging its rows and columns. The invariance of 
the magnitudes then becomes 

C /2 = CR f fiC = C 2 

showing that the product of the rotation matrix and its transpose must be 
the identity* 

/?*/?=/. (4.9) 

The condition (4.9) is the fundamental condition that all rotation matrices 
must satisfy. In particular, one finds that the identity R = I satisfies 
Eq. (4.9) and is itself a rotation matrix. 

One may further draw out the structure of a rotation by considering 
rotations which depart from the identity by an infinitesimal amount: 

R = I + Sw, 

where <5 w <C 1. Substitution of the infinitesimal rotation into Eq. (4.9) 
shows that to first order, Sw + Sw' =0, or Sw is an antisymmetric matrix 
(“skew-symmetric” if one prefers a more refined term): 

Swij + Swji = 0 . 



Antisymmetry makes less than half the n 2 components of R independent. 
The diagonal consists of unit elements and the off-diagonal components are 
reflections across the diagonal. The number of independent components is 
therefore equal to the number of components lying either above or below 



* The argument in subscript notation: £) = R,-,- £,■ gives £' 2 = the trans- 

pose it- = allows the product of the matrix and the vector to be rearranged to 



R . .£ . = £ . f?t . from which follow £' 2 = £ R\ R - h £, = £ 2 and Rt.R.. = <5 . 

’7 7 ® ’7 7 1 IK^K ll IK IK 
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the diagonal which is n(n — l)/2. One finds an n-dimensional rotation is 
specified by n(n — l)/2 parameters. 

Rotations do not exist in spaces of one dimension. In two-dimensional 
space one parameter specifies a rotation. In three dimensions there are 
three parameters. In four dimensions six parameters are required to spec- 
ify a rotation. The correspondence between the parameters which describe 
a rotation and the dimension of the space is neither simple nor intuitive. 
Only in three dimensions is the number of rotation parameters equal to 
the dimension of the space. In four dimensions the number of parame- 
ters is double the number of those in three dimensions. The fact that the 
number of parameters in four dimensions is twice those in three dimen- 
sions is a manifestation of a remarkable correspondence between three- and 
four-dimensional space we will soon encounter. 

The invariant of a rotation in n-dimensional space, 

a 2 +6 2 + --- + £n 2 =£ 2 , 

is also the equation of an (n — l)-splrere embedded in n-dimensional space. 
Spheres and rotations are therefore intimately related. The symmetry of the 
sphere is a static, geometric thing: it is the locus of all points equidistant 
(in the Euclidean sense) from an origin. The symmetry of rotations is a 
dynamic that lies upon a sphere. The rotation £' = Rf sends the point 
f over the surface of a sphere of radius |£| as the parameters within R 
are varied. A ray connecting the origin at the center of the sphere to the 
moving point performs rotations, its tip constrained to the surface of this 
sphere. 




Figure j-3. Projection of a Two-Sphere onto the Two-Dimensional Plane. A two-sphere 
(a) projected onto the two-dimensional plane reveals a pair of superimposed disks (b), 
each disk filled or foliated with a family of circles. 



The most elementary rotation is that in two dimensions. The geometric 
object which is the basis of rotational symmetry in the plane is a one-sphere: 
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the circle. Consistent with the existence of one rotation parameter in two 
dimensions, points lying on a circle in two-dimensional space are described 
by one coordinate (such as the distance along its circumference or an angle 
of rotation); hence the name one-sphere. 

The basis of rotational symmetry in three dimensions is the familiar two- 
splrere (basketball, balloon, map-globe). It has two degrees of freedom to 
locate a point upon it. In four dimensions the three-sphere is the basis of 
rotational symmetry. All spheres exhibit rotational symmetry in that when 
points on the surface are subjected to a rotation, the sphere does not change 
form. The angles specifying the location of a point on the surface of the 
sphere change; but the radius — the invariant of rotational symmetry — does 
not. 

Human beings can visualize one-splreres (circles) and two-spheres (bal- 
loons) embedded in three-dimensional Euclidean space; but we have dif- 
ficulty visualizing higher-dimensional spheres. As an aid to visualization 
and insight into the underlying connections between rotations in spaces of 
various dimension, one can examine the way in which higher-dimensional 
spheres project into more familiar spaces like the two-dimensional plane 
and three-dimensional Euclidean space. 

The two-sphere projects onto the two-dimensional plane as a pair of 
superimposed disks as shown in Fig. 4-3. Imagine a semi-transparent map- 
globe whose image falls on the plane. Each of the hemispheres of the 
globes projects as a disk. The superimposed disks may be thought of as 
the two sides of an infinitely thin coin, each side a projection of one of the 
hemispheres. The two-sphere may be defined by 

£i 2 + ?2 2 + ?3 2 = const 

and the projection achieved by setting the coordinate perpendicular to the 
plane, say £ 3 , constant. This leads to the family of circles £i 2 + £ 2 2 = const, 
each circle a projection from the two-splrere for each value of £3. Each disk 
is built up with circles (it is said to be foliated by circles). There are 
two families of circles because it takes two disks to completely cover both 
hemispheres of the two-sphere. One disk covers the region £3 < 0, the other 
covers £3 > 0 and the circular boundary (great circle) joining both covers 
£3 = 0. Although the projective points are infinitely dense within one 
another, points on different disks are disconnected by direct paths. They 
can only be connected by a path which extends to and crosses over the 
circular boundary joining them at the edges of the disks. 

The three-sphere is the basis of rotational symmetry in four dimensions. 
It projects into three-dimensional Euclidean space as two three-dimensional 
spherical balls superimposed within one another as shown in Fig. 4-4. Each 
of these spherical balls is foliated with a family of nested two-splreres. If 
the three-sphere is defined by 

Ci 2 + ?2 2 + ?3 2 + £4 2 = const, 
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and the projection is along the £4 axis, one of the balls covers the hemisphere 
£4 < 0, the other covers £4 > 0, and the bounding two-sphere (“great two- 
sphere”) which connects both covers £4 = 0. There are two families because 
it takes two spherical balls filled with two-spheres to completely cover the 
three-sphere much like it takes two disks filled with circles to completely 
cover the two-sphere. The points lying within the projection are again 
“doubly infinite” because they are contributed by two distinct regions of 
the three-sphere just as points on the disks were contributed by two different 
hemispheres of the two-sphere. Points within different spherical balls, like 
those of the disks, are disconnected by direct paths. Points within one 
ball are linked to points within the other by paths which cross the great 
two-sphere joining both. 




(a) 



(b) 



Figure 4~4- Projection of a Three-Sphere into Three-Dimensional Euclidean Space. The 
projection of a three-sphere (a) into three-dimensional Euclidean space reveals a pair of 
interpenetrating spherical balls (b), each ball foliated by a family of two-spheres. 

This property of four-dimensional space in which the three-sphere un- 
derlying its rotational symmetry may be covered by two families of two- 
spheres is more than an aid to visualization. It is exceptional. One might 
suppose that all higher-dimensional spheres could be projected into foli- 
ations of lower-dimensional spheres; but this turns out to be impossible 
except for the case of four dimensions. Four-dimensional space is unique; 
and its topological and geometric properties are quite surprising.* The re- 
markable family of relationships between one-, two-, and three-dimensional 
spheres was discovered by the topologist Heinz Hopf in the first half of this 
century. These properties, which Hopf established as pure mathematical 
entities, turn out to be the fundamental topological properties of Kepler 
motion. The beautiful mapping of the three-sphere into a pair of families 
of two-spheres may be found in Note 6. 

For our purposes it is sufficient to recognize that the unique properties 
of four-space allow a three-sphere embedded in it to be covered by a pair 
of families of nested two-spheres. Each family of two-spheres has three- 



* See Ian Stewart, The Problems of Mathematics, Oxford: Oxford University Press, 
Chaps. 9 and 17, 1987. 
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Figure Rotation in Two-Space. A single piece of information such as the angle 

Q through which the two coordinate axes are turned specifies a rotation in the two- 
dimensional plane. 

dimensional rotations as a symmetry. We therefore arrive at a defining mo- 
ment for the Kepler symmetries: a four-dimensional rotation may be repre- 
sented as a pair of three-dimensional rotations. A pair of three-dimensional 
rotational symmetry algebras like that of the angular momentum (3.18) 
therefore describes the symmetries of two-body motion with inverse-square 
forces. Let us see how this happens. 

In two dimensions with coordinates £i, £2 the rotation lies in the only 
two-dimensional subspace possible, the plane (£i,£ 2)- A rotation in two 
dimensions possesses one free parameter, such as the angle O through which 
the coordinates are rotated in Fig. 4-5. 

In three dimensions with coordinates £1, £2, £3, three independent planes 
are defined by the pairs of components (£2, £3), (£3, £1), (>£i , <£2) - These 
planes correspond to the three independent parameters required to describe 
a three-dimensional rotation. It turns out that any rotation in three-space 
may be realized as a sequence of three two-dimensional rotations, one in 
each of these planes. In each of these elementary rotations, one of the 
coordinates is held fixed while the other two are rotated. Each of these 
two-dimensional rotations is specified by a rotation angle which fixes the 
rotation in each plane as shown in Fig. 4-6. 

Three pieces of information are naturally carried by a three-vector. A 
rotation in three dimensions may therefore be described by a three-vector 
such as 17 = (fii, CI2, H3) as was done in Chapter 3. Each symmetry param- 
eter has a corresponding invariant; and the invariant which corresponds to 
fl is the angular momentum M satisfying the cyclic Poisson bracket (3.18) 
of rotational symmetry.* 

Now leap to four dimensions. Six parameters are required to specify a 
rotation in four-space. They match the six independent planes that exist 
in four-space: the three planes of three-space plus three new planes (£1,^4), 
(£25 £4)1 (£3, £4)5 formed by the fourth dimension paired one at a time with 
the three other dimensions as illustrated in Fig. 4-7. The six planes are the 
property of the pair of families of two-splreres with which the three-sphere 



* The rotation matrix in three dimensions may be explicitly exhibited through the 
alternating tensor as R ^ ; but its form in four and higher dimensions is 

more complicated. 
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Figure 1^-6. Rotation in Three-Space. A rotation in three dimensions may be realized as 
three successive two-dimensional rotations, one in each of the three independent planes 
that exist in three-space. Each of these two-dimensional rotations is specified by a 
rotation angle. The three angles naturally form a three-vector, f2 = (f^i, £^ 2 ,^ 3 ). 




Figure J^-l. Rotation in Four-Space. Six independent planes exist in four-space. A 
rotation in four-space may be realized as six successive two-dimensional rotations, one 
in each of these planes. The six angles specifying the rotations form two three- vectors, 
fl = (fii, ^ 2 , ^ 3 ) and 4> = (0i, 02, 03)- 

of rotational symmetry in four-space is foliated. A rotation in four-space 
may therefore be realized as six successive two-dimensional rotations, one 
in each of the six planes as shown in Fig. 4-7. 
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Six pieces of information are required to describe a rotation in four- 
space. Two three- vectors carry six pieces of information and are the natural 
vehicles to describe a four-dimensional rotation. One of these is again 
the vector 17. It describes the rotations in the planes (£ 2 , £ 3 ), (^ 3 , ^ 1 ), 
(^ 1 ,^ 2 ) and its invariant is the angular momentum M. The other, which 
we symbolize as <p = {<j>i,<j> 2 ,<f> 3 ), describes the rotations in the planes 
(?i 1 ^ 4)5 (£ 2 , £ 4 ), (£ 3 , £ 4 ) as indicated in Fig. 4-7. Its invariant turns out to 
be the eccentricity e. 

In summary, a single set of three-dimensional rotations possesses a sin- 
gle invariant three-vector. Two sets of three-dimensional rotations possess 
two three-vector invariants; and such a pair represents* four-dimensional 
rotations. The symmetry parameter for one set of three-dimensional rota- 
tions is 17 and has associated with it the invariant M. The additional set 
of rotations described by the symmetry parameter <p possesses the addi- 
tional invariant e. These vectors cannot be arbitrary three-vectors. They 
must possess rotational character. One must show they satisfy a rotational 
Poisson bracket algebra. 

Begin by observing that the quantity 

h= v / fcV/(-27f) 

which appears as the coefficient of M 2 in the first of Eqs. (4.8) has the 
dimensions of action or angular momentum. This quantity is a key actor in 
the classical motion of two bodies; among its many roles, we shall see it as 
the Poincare invariant. Because of its recurring importance, we shall call 
h the Kepler constant. It is intimately connected to an equally important 
quantity in the quantum motion. The Kepler constant h is the classical 
counterpart to the quantum Planck constant h. The Kepler constant is a 
constant of the motion. But the Planck constant is more — it is a constant 
of nature. We shall see the intimate connction between these two constants 
in Chapter 5. 

The eccentricity and the angular momentum invariants may be put on 
the same footing by using a slightly different dimensional form of the ec- 
centricity created through the Kepler constant: 

E = he. 

Since E is represented as a function of the invariants e and Ti (through h), 
it is similarly invariant and serves equally well in favor of e in the complete 
set of invariants for two-body, inverse-square force motion. 

Let us dare to call E the eccentrum. The eccentrum has the same dimen- 
sions as the angular momentum; indeed, it is the intimate connection be- 
tween E and M that underlies the symmetry. For bound motions (7 i < 0) 



* A representation of a symmetry is a set of vectors that satisfy its symmetry algebra. 
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the components of the eccentrum are real quantities (the components of the 
eccentricity e are always real) ; but for free motions (hi > 0) , the eccentrum 
components — and the Kepler constant h -are imaginary (Note 3). 

The Poisson bracket of the angular momentum M cyclically reproduces 
M itself. A straightforward calculation that is carried out in Note 3 shows 
that the Poisson bracket algebra for the pair of vectors M and E is 

[Mi.Mj] = e ijk M k , [Ei,Mj] = e ijk E k , [Ei,Ej] = e ijk M k . (4.10) 

From the algebra of the individual components one readily finds the follow- 
ing algebra of the magnitudes: 



[M 2 , Ei] = ~[E 2 ,E t ] = 2 e ijk EjM k 



and all scalars have vanishing Poisson brackets with the angular momen- 
tum, 

[M 2 , Mi] = 0, [ E 2 , Mi] = 0, [. M 2 , E 2 ] = 0. 

The algebra of the eccentrum in the third of Eqs. (4.10) is not the same 
as the paradigmatic algebra of three-dimensional rotations typified by the 
angular momentum in the first of Eqs. (4.10). The algebra of E, rather 
than cyclically reproducing E itself in the manner of angular momentum, 
reproduces Ml The reason for this twist is that the invariants M and E 
are not themselves the two invariants belonging to the pair of spheres into 
which four-dimensional rotations project, each of which is a rotationally 
symmetric vector with algebra like that of the angular momentum. Rather, 
they are sums and differences of these invariants as we shall soon see. The 
Poisson brackets of E are not confined to E but instead incorporate M. 
The elements of the symmetry algebra are therefore six in number: Mi, 
M 2 , M 3 , Ei, E 2 , E$, and correspond to the six components of the two 
three-vectors required to represent a four-dimensional rotation. 

The three-dimensional rotational symmetry underlying the invariant M 
is a fundamental symmetry of mechanics, no matter the force law; and the 
symmetry algebra reflects this. It can be seen that the algebra for M alone 
involves only its three components as indicated by the first of Eqs. (4.10). 
The algebra of E mixes in M as shown by the third of Eqs. (4.10), but that 
of M does not mix in E. This means that three-dimensional rotations (rep- 
resented by M alone) form a subgroup of the full group of four-dimensional 
rotations (represented by both M and E). Three-dimensional rotational 
symmetry persists in the presence of the wider symmetry in four dimen- 
sions induced by the inverse-square nature of the force. This symmetry 
is described by the angular momentum M and possesses a single Casimir 
invariant, the magnitude M. (Recall that a Casimir invariant is a quantity 
that has a vanishing Poisson bracket with all elements of the symmetry 
algebra) . 




4- Classical Mechanics: The Heavens 



99 



Since four-dimensional rotations are described by two three- vectors, they 
possess two Casimir invariants. It is not difficult to show from the first of 
the equations following Eqs. (4.10) that the Casimir invariants of four- 
dimensional rotations described by M and E are M 2 + E 2 and M E. In 
this light the symmetry conditions (4.8) natural to the eccentricity invariant 
come into their own. When e is expressed as e = E/h, they take the simple 
form 

M 2 + E 2 = h 2 , M E = 0. (4.11) 

The symmetry conditions of the two-body problem for gravity and electric- 
ity turn out to be conditions on the Casimir invariants of four-dimensional 
rotations! 

The four-dimensional rotational symmetry may be made fully transpar- 
ent as two sets of three-dimensional rotations appropriate to a pair of two- 
splreres by using the invariant vectors of these spheres out of which M and 
E are constructed. Let S and D be the invariants of these two sets of 
three-dimensional rotations. The vectors M and E turn out to be sums 
and differences of S and D: 



M=S + D, E = S -D, 

with the inverse relationships 

S = (M + E) /2, D = (M-E)/ 2. 

It is an easy matter to show by substituting for M and E in terms of S 
and D in Eqs. (4.10) that S and D are each representations of purely three- 
dimensional rotations just like the angular momentum, each reproducing 
only itself according to the cyclic paradigm of rotational symmetry: 

[Si,Sj] = ei jk S k , [Si,Dj]= 0, [D i ,D j ] = e ijk D k . (4.12) 

The vectors S and D each represent rotations in three dimensions. Each 
corresponds to one of the families of two-splreres into which the three-sphere 
of four-dimensional rotational symmetry projects. Each vector accordingly 
has the identical algebra of rotations in three dimensions and each vector’s 
algebra is independent of the other’s. 

One could therefore have begun with two copies of three-dimensional 
rotational symmetry corresponding to a pair of two-spheres, the vectors S 
and D. Putting this pair of three-dimensional representations together as 
the sums and differences M = S + D and E = S — D, one would then have 
found that M and E obey the Poisson bracket algebra (4.10) and turn out 
to be the angular momentum and eccentrum of Kepler motion. 

Although M and E are more familiar, the vectors S and D are an equally 
valid set of invariants of the symmetry equivalent to M and E in the sense 
that if one specifies the vectors M and E, the vectors S and D are uniquely 
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specified. The Casimir invariants E 2 + M 2 and M E in the two symmetry 
conditions (4.11) become in terms of S and D: 

S 2 + D 2 = h 2 /2, S 2 — D 2 = 0, (4.13) 

from which it follows that S and D are the same Casimir invariant and this 
Casimir invariant, aside from a factor of one-half, is none other than the 
Kepler constant: 

S = D = h/2. 

The radii of the spheres of four-dimensional rotational symmetry are Kepler 
constants and depend only upon the energy. The spheres have real radii for 
bound motions (hi < 0) and imaginary radii for free motions (hi > 0). The 
phase space of the invariants for free motions is therefore non-Euclidean 

(Note 3). 



The Two Faces of Rotational Symmetry 

Four-dimensional rotations are represented by a pair of three-vectors; 
but there exist two alternative representations of these three-vectors, one 
in M and E , the other in S and D. The representations are distinct be- 
cause the Poisson bracket algebras (4.10) and (4.12) for each representation 
are different; yet the states of motion corresponding to these different rep- 
resentations are the same because given M and E, the vectors S and D 
are uniquely fixed. 

Why are there two different representations of the motion of two bodies 
with inverse-square forces? They are the manifestation of a fundamental 
feature of three-dimensional rotations: three-dimensional rotations come in 
two mathematical forms, one based on three-vectors with real components, 
the other based on two- vectors with complex components known as spinors. 
The four-dimensional rotational symmetry of the motion is equivalent to a 
pair of three-dimensional rotations; and these three-dimensional rotations 
have both a vector and a spinor face. 

The significance of this duality for the two-body problem is that both 
the classical motion of planetary bodies and the quantum motion of the 
hydrogen atom have two distinct faces: one vector, the other spinor. Each of 
these faces has a corresponding geometric form. The vector face is exhibited 
in spheres and cones, the spinor face in paraboloids. The representations 
in S and D on the one hand and M and E on the other correspond to 
these two faces. We shall explore these two faces of rotational symmetry 
algebraically. Then we shall show their different geometrical forms. 

The familiar face of rotational symmetry is the vector face as exhibited 
by a three- vector such as the angular momentum, 
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which satisfies the Euclidean metric 

Mi 2 + M 2 2 + M 3 2 = M 2 . 



Whereas a vector consists of real elements, a spinor consists of complex 
elements. A spinor is built from complex two-vectors, such as 



V = 




(4.14) 



Since the two- vector is complex, four unique quantities may be formed from 
products of its two components rji , r /2 and their conjugates 17 *, 1)2 ■ Let us 
call these four quantities M + , M ~ , M + , M_. These four quantities create 
a two-matrix; and this two-matrix turns out to be the spinor (complex 
two-matrix) equivalent of a three- vector: 



f M+ 


M + \ 


| _ ( ? 7i?7i 


V 1 V 2 \ 


\M_ 


M ) 


\V 2 V 1 


V 2 V 2 ) 



(4.15) 



One can easily see that this spinor possesses a vanishing determinant, a 
fundamental property of all two-matrices built from a single two- vector: 

M+M~ - M+M_ = 0. (4.16) 



The spinor equivalent of a three-vector has a quadratic form which is an 
invariant vanishing identically. Equation (4.16) is the “spinor metric” con- 
dition which creates an invariant spinor magnitude (it is always zero) just 
as the Euclidean metric creates an invariant vector magnitude. 

The coincidence between the vector and the spinor is therefore this: a 
collection of four real things obeying the Euclidean metric is equivalent to a 
collection of two complex things obeying the spinor metric (the four spinor 
components M + , M ~ , M + , M_ are built from a single two- vector with two 
complex components): 



M = 





M+\ 
M~ ) ' 



It may seem incongruous that there are four spinor components to represent 
a three-vector; but the magnitude of the vector is encoded in the spinor 
along with its three components. The four real things in the case of the 
vector are its three components Mi, M2, M3, and its magnitude M. 

One therefore has two sets of four quantities: (M, Mi, M2, M 3 ) for the 
vector and (M + , M~ , M + , M_) for the spinor, each of which is a represen- 
tation of the same physical quantity. The relationships between the two 
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forms were first worked out by Wolfgang Pauli in the nineteen twenties. 

The spinor components are given in terms of the vector components by 

M + = M + M 3 , M + = Mi + iM 2 , 

M~ = M-M 3 , M_ = Mi — iM 2 ■ 



(note that M + and M_ are complex conjugates). The inverse relationships 
of the vector components to the spinor components are 

M = (M+ + M-)/ 2, Mi = (M+ + M_)/2, 

M 3 = (M+ - M~)/2, M 2 = (M+ - M_)/2z. 

Equations (4.17) and (4.18) reveal that the spinor neatly divides into two 
parts, the part contained on the left-hand sides of Eqs. (4.17) and (4.18) 
pairs ( M, M 3 ) with (M + , M~) while that on the right-hand sides pairs 
(Mi, M 2 ) with (M + ,M_). These two parts are “hinged” by the common 
product M + M~ = M + M_ which also shows how the spinor metric and 
Euclidean metric imply one another. The first product is M + M~ = M 2 — 
M 3 2 and the second is M + M_ = Mi 2 +M 2 2 . These two products, according 
to the spinor metric (4.16), are the same. They yield the vector metric 
condition 



M+M~ = M+M_ = M 2 - M 3 2 = Mi 2 + M 2 2 . (4.19) 

Each of the spinor parts has significance in its own right. Readers have 
perhaps already taken note of the fact that (M, M 3 ) which appears in one 
of the pairings constitutes the Casimir set of the vector. This means one 
part of the spinor is naturally the Casimir set. The spinor representation 
of the Casimir set of rotational symmetry is (M + ,M _ ). 

Consider now the part involving (Mi,M 2 ) and (M + ,M_). Unlike M + 
and M~ which have a vanishing Poisson bracket, the quantities M + and 
M_ have the same non-vanishing Poisson bracket as Mi and M 2 save for a 
factor: 

[M+,M_] = -2i[Mi,M 2 ] = — 2 zM 3 . 

Although M + and M_ do not have a vanishing Poisson bracket, their 
bracket bears a decisive relationship to the component M 3 . The Poisson 
bracket algebra of M + and M_ produces M 3 while their brackets with M 3 
result in self- rather than cyclic-reproduction: 

[M±,M 3 ] = ±iM±. 

While the elements of a rotationally symmetric vector like M cyclically 
reproduce one another, the spinors (M + , M_) constructed from them repro- 
duce only themselves. Self-reproduction gives the spinor pair (M + , M_) the 
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power to generate the values of a given component of the angular momen- 
tum such as M 3 . This is because when one forms Poisson bracket products 
of M± and M3, the algebra remains confined to M±. Increasing (or de- 
creasing) quantities of M3 are the outcomes. For this reason M + and M_ 
are called creation and annihilation operators: they create and annihilate 
values of M3. This property is exploited in quantum motion where M + 
and M_ take the central role in the quantum description of rotational sym- 
metry. They generate all the values of the angular momentum which are 
possible in quantum mechanics. We shall see the creation and annihilation 
operators M + and M_ in action in Chapter 5 . 

Each pair of spinor components of a rotationally symmetric vector car- 
ries an additional significance beyond that of simply keeping book on the 
components and magnitude of the vector. The spinor pair (M + ,M _ ) is 
the Casimir set of this vector while the pair (M+, M_) contains its creation 
and annihilation operators. 

The two representations of rotational symmetry have two different but 
closely related geometrical forms. To see this, turn to the fundamental con- 
dition that all vectors in Euclidean space obey: M + M 2 2 + M 3 2 = M 2 . 
This is a second-order algebraic expression, the general family of which 
describes spheres, cones, paraboloids, and hyperboloids. The vector spec- 
ification of the Casimir set (M, M3) describes spheres M = const and 
cones M3/M = const. The quantities M and M3 are therefore said to 
be spherical-polar. The angular momentum vector must lie on the cone 
of half-angle tan~ 1 (M 3 /M) centered on the M3 axis. The spherical-polar 
surfaces are shown in Fig. 4-8 (a). 




Figure f-8. The Two Faces of Rotational Symmetry. In (a) the Casimir set has the 
vector specification (M, Ms) and the surfaces which correspond to constant values of M 
are spheres of radius M while those which correspond to constant values of Ms/M are 
cones. In (b) the Casimir set has the spinor representation (M+,M — ). The surfaces 
which correspond to constant values of M - *" and M~ are downward and upward opening 
paraboloids of revolution with foci at the origin. 



To see the spinor face of rotational symmetry, use M = M + — M3 to 
eliminate M in favor of M + in the metric condition Mi 2 + M 2 2 + M3 2 = M 2 
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with the result 

Mi 2 + M 2 2 + 2 M+M 3 = (M+) 2 . 

This is the equation of a paraboloid. The surface generated by M + = const 
is a paraboloid of revolution with focus at the origin and which opens 
downward in the negative M3 direction. In like fashion the metric condition 
may be re-written in terms of M _ rather than M as 

Mi 2 + M 2 2 - 2 M~M 3 = (M~) 2 . 

The surface generated by M~ = const is a paraboloid of revolution with 
focus at the origin and which opens upward in the positive M3 direction. 
These surfaces are shown in Fig. 4-8 (b). The spinor representation M + 
and M~ is therefore said to be parabolic. 

Let us summarize the duality of rotational symmetry. Two representa- 
tions of a rotationally symmetric quantity like the angular momentum are 
possible: one vector, the other spinor. Corresponding to each representa- 
tion is a family of surfaces appropriate to the representation. The vector 
representation manifests itself in spherical-polar geometry while the spinor 
representation is parabolic. In the spherical-polar expression the Casimir 
set (M, M3) is specified and the components are given in terms of it and 
the free angle tan <f> = M 2 /Mi as 

Mi = \JhP - M 3 2 cos </>, 

M 2 = yj M 2 — M3 2 sin (f>, ( 42 °) 

A/3 — A / _3 . 



This system describes spheres (M = const) and cones (M 3 /M = const). 
The total magnitude of the components Mi, M 2 is specified by the Casimir 
set (M, M 3 ); but their angular position given by tan cj> = M 2 /Mi is not. It is 
the free member of the three independent quantities (Mi, M 2 , M 3 ) or (M, 
M 3 , f>) or (M + , M~, cf>) required to specify a vector in three-dimensional 
space. 

In the parabolic expression the Casimir set has the spinor representation 
(M + ,M~) and the vector components are given in terms of it by 

Mi = VM+M~ cos 4>, 

M 2 = Vm+M- sin (j>, (4.21) 

M 3 = (M+ -M~)/2. 

This system describes downward (M + = const ) and upward ( M~ = const) 
opening paraboloids of revolution. 




4- Classical Mechanics: The Heavens 



105 



Invariant Sets of Kepler Motio7i 

Three-dimensional rotations are described by the pair of Casimir sets 
{Tl, M, M3), {Tl, M, Mi). What are the Casimir sets for four-dimensional 
rotations? Since three-dimensional rotations are a subgroup of the full 
symmetry, four-dimensional rotations also possess the sets (Tl, M, M 3 ) f 
{Tl, M, Mi). By studying all the combinations of Casimir invariants and 
components in the light of Eqs. ( 4 . 12 ), one finds the four quantities {S, 
S3, D, D 3 ) are the only others with collectively vanishing Poisson brackets. 
But since the symmetry conditions ( 4 . 13 ) require S = D = h/ 2 , only three 
are independent. The inverse-square force symmetry therefore contributes 
one additional Casimir set which, since h, S , and D are all fixed by the 
energy TL , may be expressed as {H, S3, D 3 ). 

Because there are two representations of four-dimensional rotations, one 
in M and E, the other in S and D, the additional Casimir set of four- 
dimensional rotations may also be expressed in terms of the angular mo- 
mentum and eccentrum components M3 and £3 rather than S3 and D3 as 
(H, E3, M3). There are therefore two representations of the third Casimir 
set of four-dimensional rotations: (H,S3,D 3 ) and (Tl, £3, M 3 ). These two 
alternatives are the sign of the underlying dual representation of rotations 
in either vector or spinor form because four-dimensional rotations consist 
of a pair of three-dimensional rotations. 

Both vector and spinor representations therefore occur in four dimen- 
sions as well as three. The Casimir sets {Tl, M, M3) and {Tl, M, Mi) are 
clearly vector representations; but let us look more carefully at the set 
{Ti,S3,D 3 ) and its alternative {Tl, £3, M 3 ). The set (H,S3,D 3 ) is a vec- 
tor representation combining the two rotationally symmetric vector sets 
{S, S3) and {D, D 3 ) (recall that specifying Tl is equivalent to specifying S 
and D). On the other hand, the set {Tl, £3, M3) turns out to be the spinor 
representation corresponding to {Tl, S3, D 3 ). 

The spinor nature of (H, E3, M 3 ) can be seen by observing that the 
spinor representations of S and D are 

S ± = S±S 3 , D ± = D±D 3 . 

Since S = D = h/ 2 , S3 = {M3 + E3)/ 2 , and £3 = (M3 — £3)/ 2 , the spinors 
S ± and D ± can be written 

S ± = h/ 2 ±{M 3 + E 3 )/ 2 , D ± = h /2 ± (M 3 - £ 3 )/ 2 . ( 4 . 22 ) 

Eliminating h, one finds M 3 and £3 are completely expressible in the spinor 
components S ± and £ . 

M3 = ±(S' ± — £ T ), £3 = ±(S' ± — D ± ). 

A representation of the motion in the invariants {Tl, £3, M3) is thus a spinor 
representation incorporating both the spinor components S ± and £ ± . Each 
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of these spinors again corresponds to one of the two-spheres into which 
the three-sphere of four-dimensional rotational symmetry projects. The 
invariants M3 and E 3 are a composite of these two spinors, one from each 
two-splrere. 

To sum up, the third Casimir set of Kepler motion is a vector represen- 
tation when expressed in terms of the rotational invariants of the individual 
two-spheres, (ft, S3, D 3 ). This Casimir set has an alternative representa- 
tion in terms of the components of the eccentrum and angular momentum, 
(ft, £3, M3) which are sums and differences of the individual two-sphere 
invariants. In this form it is a spinor representation. When expressed in 
terms of these spinor invariants, the motion will be viewed in parabolic 
coordinates. 

Three independent Casimir sets of all the symmetries of the relative 
motion such as 



(fi,M,M 3 ), (fi,M,Mi), (fi,E 3 ,M 3 ) 

are capital quantities. The number of single- valued invariants in each 
Casimir set is three. This means the three-dimensional relative motion 
is completely integrable. There are three Casimir sets. This means the mo- 
tion is fully degenerate. The bound flow lies upon a one-dimensional torus 
and its trajectories are closed. These Casimir sets embody all the symme- 
tries of the relative motion of two bodies with gravitational or electrical 
forces. They play the organizing role in both the classical and quantum 
motion. They are the invariants with which symmetry turns complexity 
into simplicity. 



The Vector View of Celestial Mechanics 

The Casimir sets (ft, M, M3) and (ft, M, Mi) are both vector expressions 
of rotational symmetry. They give rise to the same motions, but with 
different labels since all one needs to do is interchange Mi and M 3 to 
obtain one motion from the other. It is sufficient therefore to consider only 
one of them, say (ft, M, M3). 

The symmetric coordinates of the vector specification (ft, M, M 3 ) are 
spherical-polar. The position vector in the center of mass frame of reference 
is r = (x\,X2,x 3 ). (The symbols x\, X2, x 3 are now to be understood as 
the coordinates in the center of mass frame of reference.) In spherical-polar 
coordinates (q± = r, q 2 = <j>, q 3 = 9) shown in Fig. 4-9, a point in space is 
specified by the length r of the position vector and two angular coordinates, 
a polar angle <f> which fixes the rotation of the position vector about the x 3 
or polar axis and a latitudinal angle 9 which fixes a cone (centered about 
the x 3 axis) upon which the position vector must lie. (One requires a single 
axis for designation as the “polar’ axis. The x 3 direction is arbitrarily 
selected.) 
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The coordinate grid of the system is formed by the intersections of 
spheres (r = const), planes (<f> = const), and cones (9 = const). The 
motion of two-bodies bound by central forces is naturally matched to the 
symmetries of these surfaces; and the Hamilton- Jacobi equation will sepa- 
rate in their coordinates. 



X3 




Figure 4~9 ■ spherical-polar Coordinates. A point in space is specified by the length 
of the position vector r and two angular coordinates: a polar angle <fi which fixes the 
rotation of the position vector about the xz axis and a latitudinal angle d which fixes 
a cone (centered about the xz axis) upon which the position vector must lie. The 
coordinate grid of this system is formed by the intersections of spheres (r = const), 
planes ( <p = const), and cones (6 = const). 



The integrals for the central force symmetry may be determined by sep- 
arating the Hamilton- Jacobi equation in spherical-polar coordinates. The 
spherical-polar coordinates r, </>, 9 are related to the rectangular coordi- 
nates x\, X2, Xz by 

Xi = r sin 9 cos <j>, X 2 = r sin 9 sin c\>, xz = r cos 9. (4.23) 

The corresponding velocities are 

x\ = sin 9 cos <f)f — sin 9 sin + cos 9 cos <f>r9, 

X 2 = sin 9 sin <f>r + sin 9 cos + cos 9 sin cj>r6, (4.24) 

xz = cos 9r — sin 9r9. 

The Lagrangian is L = T — V = fixf / 2 — V, or using Eq. (4.24), 



L = n(f 2 + r 2 sin 2 9<j) 2 + r 2 9 2 ) /2 — V. 



(4.25) 
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The three momentum components are p r = dL/dr, p $ = dL/d(j>, pg = 
d L/dO, or 



p r = pr, Ptf, = pr 2 sin 2 0<j), pg = pr 2 9. (4-26) 

It should be noted that the momenta in these coordinates have different 
dimensions. The momenta p $ and pg have the same dimensions as the 
angular momentum. 

The angular momentum takes a natural form in spherical-polar coordi- 
nates. The polar angle coordinate f> describes rotations about the X 3 or 
polar axis. Since the angular momentum describes the rotational aspects 
of motion, the polar angular momentum component M 3 must bear a direct 
relationship to the angular coordinate </>. A calculation using the relation- 
ships (4.23) and (4.26) shows that the polar angular momentum component 
M 3 is identical to the linear momentum p $ conjugate to <j>: 

M 3 = p(xi ±2 — X 2 X 1 ) = pr 2 sin 2 Of = p^. (4.27) 

By similar calculation, the total angular momentum in spherical-polar co- 
ordinates has the form 



M 2 = p 2 r 4 (sin 2 9f 2 + 0 2 ) . 

Using the second and third of Eqs. (4.26), the total angular momentum can 
also be expressed as 

M 2 = M 3 2 / sin 2 9 + p 2 g. (4.28) 

The Hamiltonian Tt = Pifi — L follows from the Lagrangian (4.25). The 
Hamiltonian and Hamilton-Jacobi equation for TL = 8 are then 

H = pl/2p + pl/2pr 2 sin 2 6 + p 2 g/2pr 2 + V(r) = £. (4.29) 

Since each momentum component is a function of only its conjugate position 
coordinate, the potential and kinetic energy terms corresponding to each 
coordinate may be grouped together and the Hamilton-Jacobi equation 
separated. The potential energy must be of such a form that this is possible. 
For a central force potential V = V(r) separation is indeed possible and 
the Hamilton-Jacobi equation (4.29) may be partitioned as 

r 2 sin 2 9 [p 2 r /2p + pg/2pr 2 + V (r) - £\ + p\/2p = 0. (4.30) 

The Hamilton-Jacobi equation now splits into three equations. The first 
term on the left of Eq. (4.30) is a pure function of r and 9 while the second 
is a pure function of </>. Since they must sum to zero for arbitrary r, 9 , and 
<j>, this can only occur if both terms are the same constant with opposite 




4- Classical Mechanics: The Heavens 



109 



sign. The Hamilton- Jacobi equation for a central force thus reaffirms that 
Prj, is the invariant angular momentum component p ^ = M 3 . 

The invariance of the angular momentum leads directly to the second of 
Kepler’s laws: a planetary body sweeps out equal areas of its orbit in equal 
times. The differential of a wedge-shaped area of the plane formed by the 
angle df> is 

dA = r 2 d(j)/2 = r 2 <pdt/ 2. 

The polar momentum component is given by p$ = M 3 = pr 2 (f>. Eliminate 
r 2 4> in the area differential in favor of the angular momentum and obtain 
the desired proportionality: 



dA = (M 3 / 2p) dt. 

One learns more than that the area swept out is proportional to time. One 
also learns that the factor of proportionality is the ratio of the angular 
momentum to the reduced mass. Kepler’s second law rests strictly on the 
fact that the force is central. It is not restricted to inverse-square forces. It 
holds for arbitrary central force fields. 

Equation (4.30) may be further separated by re-expressing it in terms of 
Ptj, = M 3 and rearranging as 

r 2 [pl/2p + V(r) - £\ + [M 3 2 /2/zsin 2 9 +p 2 0 /2p\ =0. (4-31) 

The two terms on the left of Eq. (4.31) are pure functions of r and 9 , 
respectively; hence each of them must be a constant which, from Eq. (4.28), 
may be recognized as the total angular momentum M 2 . Two final separated 
equations thus split from the original Hamilton-Jacobi equation: 

p 2 g + M 2 / sin 2 9 = M 2 (4.32) 

and 

p 2 + 2p \V(r) — £] = —M 2 /r 2 . (4.33) 

The Hamilton-Jacobi equation splits into three equations and the three 
elements of the Casimir set turn out to be the separation constants of these 
three equations. 

The action in splrerical-polar coordinates may be expressed as 

S(r, 9, </>, t) = -£t + S r (r) + S^{4>) + S e {9). 

The complete solution for the action is now reduced to quadratures of the 
separated Hamilton-Jacobi equations. For the partial action dS^/dcj) = 
p,), = M 3 , the integral is 



{(f) = j p r j > dcj) = M 3 (f>. 



(4.34) 
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The partial action Sg(9) = J pg d6 is formed with pg taken from Eq. (4.32): 



Sg(9) = J \J M 2 - M 3 2 / sin 2 6 dO. (4.35) 

The radial momentum p r may be isolated from Eq. (4.33) for the radial 
action S r (r) = f p r dr: 

S r (r) = J s/2p [£ - V(r) - M 2 / 2 /zr 2 ] dr. (4.36) 

The actions S r (r), S </,(</>), Sg(9) constitute the solutions of the Hamilton- 
Jacobi equation. These solutions are expressed in terms of constants of 
integration appropriate to the vector representation of rotational symmetry, 
/ = ( Tt, M, M 3 ) where Tt = £. 

The actions S^((j)) and Sg(9) are fully determined; but the action S r (r) 
is not determined until the potential V(r) is specified. This is because 
the separation is a result of the rotational symmetry of the central force. 
An infinite family of generally nonisolating actions and orbits satisfy this 
symmetry. They do not yield a closed trajectory. The motion is separable; 
but the integrals are not fully isolating. 

For the particular central force potential V = — k/r of gravity and elec- 
tricity, the action S r (r) becomes fully determined as the quadrature 

S r (r) = J sj2p [E + k/r - M 2 /2 pr 2 } dr (4.37) 

and this particular action integral is isolating (see Note 2). 

Symmetry permits separation of the Hamilton-Jacobi equation into pure 
functions of the separate coordinates which may be integrated by quadra- 
ture. The separation constants which appear in the integrals turn out 
to be the Casimir set of the symmetry. Since spherical-polar coordinates 
match the rotational symmetry of central forces whose corresponding invari- 
ant is the angular momentum, the separation constants are appropriately 
(Tt, M, M3). The integrals for S r (r), Sg(9), and S^,(<)>) are achieved for any 
potential of the form V = V(r). 



The Spinor View of Celestial Mechanics 

The Casimir set (Tt, £ 3 , M 3 ) is the spinor expression of rotational sym- 
metry. This symmetry exists because of the inverse-square nature of the 
force and manifests itself in parabolic geometry. The parabolic coordinate 
system is shown in Fig. 4-10. The parabolic coordinates q± = £, Q 2 = 7 ?) 
<73 = <p are related to the rectangular coordinates x\, X 2 as 



&i = p cos 



X 2 = P sm 




4 - Classical Mechanics: The Heavens 



111 



where p is the projection of the position vector onto the plane x 3 = 0 given 

by 

p 2 = £,V = Xx + X2 2 . 

The radial coordinate r and polar-axis coordinate X3 are given by 
r={Z + r))/ 2 , x 3 = (£ - ??)/ 2 , 

with the corresponding relationships 

t = r + x 3 , 77 = r — x 3 . 

%3 




Figure 4.- 10 . Parabolic Coordinates. The polar coordinate <j> represents rotations about 
the X3 axis. Surfaces formed by <j> = const are planes. Surfaces formed by £ = const are 
paraboloids of revolution about the X3 axis whose focus is at the origin and which open 
downward in the negative X3 direction. Surfaces formed by 77 = const are paraboloids of 
revolution which open upward in the positive X3 direction. 



The polar coordinate <j> plays the same role in parabolic coordinates that 
it plays in spherical-polar coordinates describing rotations about the x 3 
or polar axis. Surfaces formed by £ = const are paraboloids of revolution 
about the x 3 axis whose focus is at the origin and which open downward 
in the negative x 3 direction. Surfaces formed by 77 = const are similar 
paraboloids of revolution about the x 3 axis with focus at the origin and 
which open upward in the positive x 3 direction. The coordinate grid con- 
sists of the intersections of downward and upward opening paraboloids of 
revolution £ = const, 77 = const, and planes <j> = const. 

Since the magnitude of the position vector in parabolic coordinates 
is r = (£ + rj) /2, the intersections of the two paraboloids of revolution 
£ = const, rj = const are circles upon which the magnitude r of the posi- 
tion vector is constant. The energy Tt, which is purely a function of r, is 
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therefore constant on these circles. Motion described by the Casimir set 
{hi. E 3 , M ?J ) is naturally matched to these coordinates and the Hamilton- 
Jacobi equation will separate in them. 

The velocities in parabolic coordinates are 

±1 = cos fp — sin 4>p4>, &2 = sin <f>p + cos (f>pq i>, x 3 = (£ — if}/ 2 , 

where 

P = P (C/2? + i]/2 77 ) . 

The Lagrangian L = pxf /2 — V is then 

L = p(f + V ) (f 2 /^ + r, 2 /4 V ) + p^/2 - V, 
and the momenta p £ = dL/df , p v = dL/dp , p $ = dL/df are 

PH = P(Z + V)£/^, Pv = + V)V/^V, P* = P&<i>- 

The Hamiltonian and Hamilton- Jacobi equation then take the form 

U = (&V 2 i L + PPv/ 2 P) + PfP\l 2 P + V = £. (4.38) 

To separate the Hamilton- Jacobi equation in parabolic coordinates first 
observe that since V = V(r) and r = (£ + rf)/2, the polar momentum p ^ 
contains the only f dependence; hence it is a constant, the polar angular 
momentum M 3 . 

An inverse power potential (corresponding to the inverse-square force) 
is a function only of f and p in V = — k/r = —2 k/(f + 77 ). This structure 
matches the structure of the kinetic energy in Eq. (4.38) and it is therefore 
possible to completely separate the Hamilton- Jacobi equation in parabolic 
coordinates. To do so, eliminate p $ in favor of the invariant M 3 , substitute 
V = —2k/(f + 77 ), and re-express Eq. (4.38) as 

Z(£-2pl/p-M 3 2 /2pe + k/Z) 

+ v{£- 2 P%/P ~ m 3 2 / 2prf + k/rf) = 0. 

Since the two lines of Eq. (4.39) are each pure functions of different co- 
ordinates which sum to zero, they must each be the same constant with 
opposite sign. Appropriately, the separation constant for this symmetry is 
e 3 , the polar component of the eccentricity. 

This connection may be shown by directly calculating e 3 in parabolic 
coordinates. The term p x M which appears in the eccentricity (4.7) may 
be expressed in terms of r and p through the identity pxM = rp 2 -p(r-p). 
The polar component of the eccentricity is then 



e3 = (X3 P 2 ~p 3 r-p) /kp - x-i/r. 



(4.40) 
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Using the parabolic coordinate relationships, find 
2 

P3 = ~ VPr,), r ■ p = &£ + Tip,,. (4.41) 

The total squared momentum may be taken from the kinetic energy in the 
first term of Eq. (4.38) as 

p 2 = ^ (£pf + + ^pI- (4.42) 

Put together the pieces from Eqs. (4.41) and (4.42) and the eccentricity 
(4.40) has the parabolic coordinate form 

63 = 1+^ ^ “ P ^ ,kp + ~ h+v' (4 ' 43) 

The momentum component p $ is the invariant M 3 . The eccentricity may 
be separated into pure functions of £ and rj by partitioning Eq. (4.43) in 
the manner 



[(1 - e 3 ) - 2 &*/kp - M 3 2 /2krt]/£ 

-[(l + e 3 )- 2 qp 2 /kn- M 3 2 / 2 kprj\/r]= 0 . 

The terms on the first line of Eq. (4.44) are a pure function of £ while those 
on the second line are a pure function of 77 . Each line must therefore be the 
same constant with opposite sign. This separation constant may be seen 
to be the total energy TL = 8 by comparing Eq. (4.44) with Eq. (4.39). 
The total energy is the separation constant for the eccentricity while the 
eccentricity is the separation constant for the total energy in the Hamilton- 
Jacobi equation (4.39). 

The Hamilton- Jacobi equation therefore splits into the additional pair 



£(£ ~ 2 pl/p~M 3 2 / 2 pe) /k+l= e 3 , 

77 {£ - 2 p 2 /p - M 3 2 / 2 pi) 2 ) /k+ 1 = -e 3 . 



(4.45) 



The polar component of the eccentricity e 3 is given by the same function 
of the parabolic coordinates £ and 77 . The 77 = const family is the reflection 
of the £ = const family. 

This symmetry in which £ and 77 interchange roles when e 3 changes 
sign is a reflection symmetry. The eccentricity vector, unlike the angular 
momentum, changes sign when the coordinates are reflected: r — > — r. The 
radial coordinate r = y/xixi is unchanged under reflection; however, the 
polar-axis coordinate X 3 is reflected. Since the parabolic coordinates are 
given by £ = r + X 3 and 77 = r — x 3 , coordinate reflection interchanges £ 
and 77 as it reflects e 3 . 
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The full description of the motion is contained in the action 

£, V, t ) = -£t + S^if) + S^(0 + ^(??). 

The partial actions follow as quadratures = f P<j> d<j>, Sg(£) = f d£, 

and S v (rj) = fp v dr] of the separated Hamilton- Jacobi equations. The 
action S^(<j>) = is identical to that in spherical-polar coordinates. The 
momenta p j and p v may be isolated from the separated Hamilton- Jacobi 
equations, Eqs. (4.45), to yield the action integrals 




The two action integrals S v (r]) are identical functions save for the 

sign of e 3 . With S^{f) and S rl (rf) represented by quadratures according to 
Eq. (4.46), = M 3 <p, and TL = £, the action is completely determined 

in terms of the spinor Casimir set I = (Ti, e 3 , M 3 ). 

The action integrals (4.46) for S^(f) and S n {rf) should be compared with 
that for S r (r) given by Eq. (4.36). Whereas the action S r (r) is not deter- 
mined because the central force potential is the general potential V(r), the 
actions .S/(£) and S n (rf) are fully determined because they exist only for 
the solitary central force potential V = —k/r = — 2/(£ + rf). Separation in 
parabolic coordinates is fully degenerate whereas separation in spherical- 
polar coordinates is only partly degenerate. When S^(f) and S v (rf) given 
by Eq. (4.46) are compared with S r (r) given by Eq. (4.37) for the particular 
potential V = —k/r, all integrals are found to have an identical structure: 
the arguments of the square roots are quadratic functions of r, £, or p. The 
spinor actions S^(f) and S rl (rf) also differ from the vector action S r (r) by 
the appearance of the factor 1/2. 

Separation of the Hamilton- Jacobi equation yields all the invariants of 
motion, particularly those invariants which lie beyond the Galilean in- 
variants. The forte of the method of separation of the Hamilton- Jacobi 
equation is the direct link it forges between a mechanical symmetry and a 
corresponding coordinate geometry. For two-body motion subject to grav- 
itational and electrical forces these invariants are the energy, the angular 
momentum, and the eccentricity. The corresponding geometrical figures of 
these symmetries are spheres, cones, planes, and paraboloids of revolution. 

Planar Symmetries — A Digression 

One can scarcely avoid wondering if the most basic coordinates of all — 
rectangular coordinates — possess any symmetries of motion for two bodies. 
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They do; but not for inverse-square forces. Rectangular coordinates, formed 
by the intersections of mutually perpendicular planes X\ = const, X 2 = 
const, X 3 = const, are the symmetry coordinates for the potential V = 
nr 2 / 2 of Hooke motion. This potential corresponds to the attractive linear 
force f = —nr and the Hooke Hamiltonian is 

TL = p 2 /2fi + nr 2 / 2 . 

The Hooke Hamiltonian in the six-dimensional phase space £ = (r, p ) of the 
relative motion can be seen to be the equation of a five-sphere embedded in 
this six-dimensional space. It shows that the phase space of Hooke motion 
is Euclidean. 

The linear central force f = —nr and the inverse-square central force 
/ = — k/r 2 are unique in mechanics. Whereas the inverse-square force is 
the only inverse power-law force for which an additional isolating invariant 
exists that endows the motion with a closed orbit, the linear force is the only 
direct power-law force for which the motion likewise possesses an additional 
isolating invariant which closes the orbit (Note 2). For the full range of 
power-law central forces / = — yr^, where /? is any positive or negative 
number, isolating invariants corresponding to closed orbits of the motion 
may be found for only the two values (3 = 1 and (3 = —2. The intimate 
connection between the bound motion of two-bodies subject to the inverse- 
square force (Kepler motion) and to the linear force (Hooke motion) is 
described in Note 6. 

The additional isolating invariant for the linear force f = —nr corre- 
sponding to the potential V = nr 2 /2 turns out to be the matrix AC 

Aj = PiPj/2p + nx.iXj/2. 

It is a straightforward matter to show that the Poisson bracket [A TL] van- 
ishes and A is an invariant. Since A is a symmetric three-by-tlrree matrix, 
it has only six independent components. But A, like the eccentricity e, 
contributes only one new scalar invariant to the motion because it is not 
independent of the other invariants TL and M. For example, the sum of the 
diagonal components of A is the total energy of the motion: 

An = p 2 / 2p + kr 2 / 2 = TL. 

As a final tribute to the intimate connection between the inverse-square 
and linear central force laws, it turns out that bound Kepler and Hooke 
motions may be transformed into one another. Hence, bound Kepler mo- 
tion also has a phase space that is Euclidean. This is a remarkable corre- 
spondence that originates in the four-dimensional rotational symmetry of 



* See Jean Sivardiere, “Laplace vectors for the harmonic oscillator,” Am. J. Phys. 57 
(6), (1989). 
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Kepler motion. Since it must satisfy the equation of the three-sphere and 
this equation is the condition of a Euclidean space, Kepler motion may be 
transformed into coordinates in which the phase space is Euclidean. 

The orbits of bound motions for both the Hooke force and inverse-square 
force are ellipses; but the center of mass for the Hooke force lies at the center 
of the ellipse while for inverse-square forces the center of mass lies at one 
of the foci. Readers may consult Note 6 to see how the three-sphere upon 
which Kepler motion exists in phase space projects onto a pair of two- 
spheres and in so doing find that Kepler motion is transformed into Hooke 
motion. 



The Orbits 

The orbits for inverse-square forces may now be drawn out; they di- 
rectly follow from the action. Since both a vector (spherical-polar) and 
spinor (parabolic) representation of the action exist, either may be used 
to produce the orbits. Let us use the spherical-polar representation. [In- 
terested readers may want to develop the orbits using Eqs. (4.46) for the 
parabolic actions.] 

The directions natural to two-body motion are the three orthogonal di- 
rections formed by the vectors M, e, and M x e of Fig. 4-11. These three 
vectors form an orthogonal set of basis vectors for the motion. The vector 
Casimir set (Tt, M, M 3 ) singles out the X 3 direction as the direction of the 
component M3. It is therefore appropriate to align the X3 axis with M as 
shown in Fig. 4-11 for the vector case. As a consequence, e 3 = 0. (For 
the spinor representation (Tt, 63 , M3), it is appropriate to align the X3 axis 
with e in which case M 3 = 0.) The angle 9 then takes the permanent 
value 9 = 7t/2, the momenta p 3 and pg vanish, and the angular momentum 
possesses only a polar component for which M = M3. Let the x± axis be 
aligned with e. The invariant e is a vector drawn from the origin of a 
conic section at the focus to its center. Since X 3 is aligned with M, the 
motion lies completely in the plane 9 = tt/2. The eccentricity vector e lies 
in this plane with e 3 = 0. Choose the origin for the angle </> by aligning the 
position vector r with the eccentricity vector e at 4 > = 0. 

In this potential the radial motion is an equivalent one-dimensional mo- 
tion in the potential W(r) as shown in Fig. 4-12. The potential W(r) has 
a minimum energy of magnitude 

£ 0 = -k 2 p/2M 2 . (4.47) 

This is the lowest value or ground state of the energy available to the relative 
motion. The energy is bounded in the range £q ^ £ < 00 . 

A point at which the radial momentum p r vanishes is a libration point (a 
point at which a coordinate reaches a maximum or minimum in the periodic 
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Figure 4~H- Coordinates for Two-Body Motion. The three natural orthogonal coordi- 
nate directions are given by the vectors sM, e, and M x e. For the vector representation 
in spherical-polar coordinates, the X 3 axis is aligned with the angular momentum vector; 
hence p% = 0 and e% = 0. (For the spinor representation in parabolic coordinates it is 
appropriate to align the X 3 axis with the eccentricity vector in which case M3 = 0.) The 
motion lies completely in the plane 6 = 7r/2. The eccentricity vector lies in this plane 
and is aligned with the x\ axis. The #2 axis is aligned with M x e. 

motion of the trajectory). For a given energy TL = £ , this point is defined 
by the condition p r = 0 in Eq. (4.5) which requires £ = W(r) and hence, 
according to Eq. (4.6), 



£ = W(r) = M 2 /2[ir 2 — k/r. 

Since this condition is quadratic in r, there are two libration points corre- 
sponding to its two roots. The roots are neatly summarized by the quan- 
tities 

?’o = M 2 /kp,, e 2 = 1 - £/£ 0 . (4.48) 

The quantity r o is the value of r at the minimum energy point and e is 
the magnitude of the eccentricity vector e which may also be expressed in 
terms of the Kepler constant as 

e 2 = 1 - M 2 /h 2 . 

The two roots representing the libration points are then 

r m in = r 0 /(l + e), r max = r 0 /( 1 - e). (4.49) 

The libration points are the intersections of the lines H = £ with the 
potential curve W(r) in Fig. 4-12. The distance of closest approach to the 
origin, or pericentron, is r m - ln and this quantity exists for all values of the 
energy above the ground state energy £q. On the other hand, the distance 
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Energy 




Figure 4-12. The Effective Potential Energy Function W(r) of Tuio-Body, Inverse- 
Square Force Motion. Bound orbits exist for energy £ < 0. The ground state energy is 
£ 0 = -pk 2 /2M 2 . 

of furthest excursion, the apocentron , only exists for energies £ < 0 which 
implies 0 < e < 1. For £ = £q, there results e = 0, r m ; n = r max = Tq, and 
the orbit is a circle of radius r o- The motion is confined to an annulus of 
outer radius r max and inner radius r m j n as shown in Fig. 4-1. 

How much of the total energy of two bodies is kinetic and how much is 
potential? From Eqs. (4.47) and (4.48) one sees that the ground state total 
energy is half the potential energy: 

So = V/2 = -k/2r 0 . 

Since £q — T + V, the kinetic energy in the ground state must be half 
the magnitude of the potential energy and of opposite sign. The ground 
state energies therefore bear the relationship T — —£q and V = 2£q. For 
inverse-square forces, the potential energy has twice the magnitude of the 
kinetic energy. It is an interesting fact that these relationships are true for 
the average values of T and V over any orbit, not just the ground state 
orbit: 



£ = (T) + (V), 

(T) = -£, (V) = 2£. 

Only in the ground state are T and V as well as £ constant over the orbit. 

It can be seen that the energy determines the kind of trajectory per- 
mitted. Three classes exist depending upon whether £ is negative, zero, or 
positive as shown in Fig. 4-13. The class corresponding to £ < 0 consists of 
bound orbits for which the bodies are forever within finite range of one an- 
other; they have insufficient kinetic energy to reach infinity. The class with 
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£ > 0 consists of those trajectories that are free. In this class the bodies 
have an excess of kinetic energy at infinity. They approach from infinity, 
encounter one another, and then escape, never to encounter one another 
again. Bodies in motion at infinity are necessarily on free trajectories. 

The trajectory £ = 0 separates the free trajectories from the bound 
orbits. Bound orbits with £ — > 0 are very large ellipses with apocentrons 
which approach infinity but nonetheless remain periodic. At the limit £ = 0 
the bodies fail to return periodically. Bodies on parabolic orbits leave and 
return from infinity with vanishing kinetic energy: they are in a state of 
rest at infinity. 




M x e 




(a) hyperbolic trajectories (e > 1) 



a = ro/(l — e 2 ) 



(b) parabolic trajectories (e = 1) 
M X e 



b = r o/ V 1 - e 2 




(c) elliptic orbits (e < 1) 



Figure j- IS. The Trajectories of Two-Body, Inverse- Square Force Motion. Free trajec- 
tories corresponding to e > 1 (£ > 0) are hyperbolas. They are separated from bound 
orbits by the parabolic trajectory for which e = 1 (£ = 0). Bound orbits are ellipses 
characterized by e < 1 (£ < 0). 



The known action of the Kepler problem is contained in Eqs. (4.34)- 
(4.37) for the vector representation and in Eqs. (4.46) for the spinor repre- 
sentation. The trajectories in configuration space are obtained as a canon- 
ical transformation from the invariant set of the elementary flow for which 
the action is the generating function. For the vector representation, the 
total action is S(r, <j>, 9, 1, t ) = — £t + S r (r ) + S^tj)) + Sg(9) and the invari- 
ant set is I = (£,M, M 3 ). The transformation to the primitive coordinates 
(r, 9, (j>) follows the canonical transformation of Eqs. (3.40). 
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With the coordinates oriented so that M3 = M, the 9 equation simply 
yields pg = 0 and 9 = const = tt/ 2. The trajectory in space <j>(r) is 
contained in 

dS/dM = <j) + dS r /dM = const, 
while that in time r(t) is given by 

dS/d£ = —t + dS r /d£ = const. 



With the action <S^(</>) = M(f> and S r (r) taken from Eq. (4.37), the trajec- 
tory equations are 



M dr 



r 2 \j2[i {£ + k/r — M 2 /2pr 2 ) 



(4.50) 



and 



t = 



p dr 

\j2p {£ + k/r — M 2 /2gr 2 ) 



(4.51) 



Equation (4.50) yields the orbit in space showing how <j> and r are related 
while Eq. (4.51) gives the orbit in time relating r to t. Each of these orbit 
integrals has an interesting story to tell not only about the motion of two 
bodies but about the mathematical nature of orbits generally. 

Take first the orbit in space, the integral (4.50). The invariants £ and 
M may be expressed in terms of e and ro according to Eq. (4.48) and the 
equation may be integrated directly with the result 



r(<t>) = ro/(l + ecos <j>). (4-52) 

Equation (4.52) describes the family of conic sections with the closest focus 
at the origin, eccentricity e, and latus rectum ?’o- The latus rectum of the 
complete set of orbits turns out to be the same as the radius of the ground 
state orbit: ?’ 0 = M 2 /kp. It does not depend on the energy as does the 
eccentricity. (The properties of conic sections are reviewed in Note 4.) The 
three ranges for the energy correspond to three ranges for the eccentricity, 



elliptic orbits : 
parabolic orbits : 
hyperbolic orbits : 



£ < 0, 0 < e < 1, 

£ = 0, e = 1, 

£ > 0, e > 1. 



For e < 1, the orbit is an ellipse with one focus at the origin. (The “ellipse” 
for e = 0 is a circle with center at the origin.) For e = 1, the trajectory is a 
parabola. For e > 1, the trajectory is a hyperbola with focus at the origin. 
All three trajectories are illustrated in Fig. 4-13. 

There is a royal road to the trajectory in space bypassing Eq. (4.50). 
It is provided by the hidden invariant of Kepler motion, the eccentricity 
vector. Project the eccentricity vector onto the position vector and obtain 



r ■ e = re cos <j) = r ■ (p x M ) /kp — r. 



(4.53) 
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Since r • (p x M) = (r x p) ■ M = M 2 , the orbit in space (4.52) also follows 
from Eq. (4.53) and the fact that M 2 fk[i = r 0 . Equation (4.52) which 
follows from the invariance of e describes a closed trajectory. It therefore 
manifests the direct connection between this additional invariant and the 
closed-orbit degeneracy. 

Let us examine the bound orbits which are ellipses corresponding to 
£ < 0 and e < 1 in Fig. 4-13 (a). The major axis of the ellipse is aligned 
with the eccentricity vector which is drawn from the origin of the ellipse 
at the focus closest to pericentron to its center. As shown in Note 4, the 
semi-major axis a and semi-minor axis b of the ellipse are given by 

a = r 0 /(l-e 2 ), b = r 0 /y/l- e 2 , (4.54) 

which, for the conditions (4.48) of Kepler motion, are 
a = k/2\£\, b = M/y/2n\£\. 

The semi-major axis a depends only upon the total energy of the bodies. 
On the other hand, the semi-minor axis depends upon both the angular 
momentum and the energy. The tightest orbit is the circle corresponding 
to the ground state for which a = ro = k/ 2 \£q\. The orbits widen for 
\£ | — > 0 approaching the limit of the parabolic orbit at £ = 0. 

Children learn to draw the ellipse by using two invariants: the distance 
between two pegs driven into a surface and the length of a loop of string 
slipped over the pegs. A pencil traces out the orbit as it draws the string 
taught. The length of celestial string required for two heavenly bodies 
is related to the mechanical invariants £ and M as (1 + e)k/\£\ and the 
heavenly pegs must be separated by the distance ek/\£\. 

The parabolic trajectory corresponding to e = 1 approaches from infin- 
ity, crosses the X 2 axis at a distance ro from the origin, and crosses the 
eccentricity vector at exactly half this distance from the origin as shown in 
Fig. s (b). 

The free trajectories are hyperbolic and correspond to e > 1. They pass 
from infinity and cross the eccentricity vector a distance ?’ m i n = a/(e + 1) 
from the origin where the parameter a takes the value a = ro/ (e 2 — 1) 
for the hyperbolic case. Like all the other trajectories, they also cross the 
X 2 axis in Fig. 4-13 (c) at a distance ro from the origin. Bodies on free 
trajectories are in motion at infinity {£ >0). For very energetic bodies for 
which £ |£ 0 |, the trajectory is a straight line which passes a distance 

ro/e from the origin. 

The elliptical orbits of the planets are shown in Fig. 4-14. The orbits of 
the giant planets are nearly circular. Neptune has the most circular orbit of 
all the planets with an eccentricity of 0.009. Pluto (the most distant planet 
from the sun) and Mercury (the closest) possess the most eccentric orbits 
(e = 0.25 and e = 0.21, respectively). The earth’s orbit is one of small 




122 



The Shaggy Steed of Physics 



eccentricity e = 0.017. The asteroid Icarus, whose orbit lies within the 
inner planets, possesses an eccentricity e = 0.87. Ceres and its associated 
asteroids which lie between Mars and Jupiter have eccentricity e = 0.076. 

How long does it take the bodies to traverse their trajectories? The 
typical time over which the bodies significantly interact may be directly 
found from the mechanical quantities which describe the motion: £, M, k, 
and /i. There are two time scales which one can form from these quantities, 
one based upon £ and one based upon M. One can see that there are 
two different time scales by imagining first that the angular momentum 
M vanishes. The time scale is then established by the rate at which the 
inverse-square force draws the bodies together along a straight line and is 
determined by the total energy £. On the other hand, imagine the energy 
vanishes. In this case the angular momentum dominates the motion and it 
determines the time scale of the interaction. 

The set of quantities containing £ (but not M) may be combined to form 
a quantity solely possessing the dimensions of time: the Kepler frequency, 

to = i/8 (— £) 3 //c 2 /r = \Jk/ n/a 3 / 2 . (4.55) 

The Kepler frequency has the dimensions of a reciprocal time and it will be 
shown that its reciprocal indeed gives the time it takes the bodies to orbit 
one another for the elliptic case or the time over which the bodies signif- 
icantly deflect one another for the hyperbolic case. The Kepler frequency 
is appropriate for both elliptic and hyperbolic orbits for which 5^0; but 
it vanishes for the parabolic case for which £ — 0. It is here that the time 
scale formed from M, k, and /r rather than £ becomes appropriate. A 
parabolic frequency appropriate for this case may be formed as 

coo = fc 2 /i/M 3 = yjk/n/rl /2 . 

While the Kepler frequency to sets the time scales for elliptic and hyperbolic 
orbits, the time parameter which describes parabolic orbits is loq- The two 
time scales are related by the eccentricity as 

lo/lo 0 = (1 - e 2 ) 3/2 . 



To find the exact expression for the time along the trajectory one must 
integrate the time equation (4.51). This equation may be made more trans- 
parent for the elliptic and hyperbolic cases by introducing the Kepler fre- 
quency as well as the orbit parameters a and e in favor of £ and M: 



tot = 



r dr 

a\Ja 2 e 2 — (r — a ) 2 



(4.56) 



The parabolic case is special. The integral (4.51) for parabolic orbits for 
which £ = 0 simplifies to 



u>ot — 



/ 



r dr 



r W( 2r / r o - 1) 



(4.57) 
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Figure f-lf. Orbits of the Planets. 
The planets are shown in scaled dis- 
tances from the sun. Orbit tracks for 
each planet are shown for one earth- 
year. Mercury (the nearest to the sun) 
and Pluto (the most distant) have the 
most eccentric orbits. 
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Unlike the space relationship between r and <f> which is integrable in ele- 
mentary functions — the conic sections (4.52) — the time relationship (4.56) 
is not directly integrable in elementary functions. Mathematicians of the 
late seventeenth and eighteenth century found that an integrable result in 
elementary functions could be obtained through a transformation from the 
angular variable </> to an new angular variable if such that 



r = a{ 1 — e cos if), 


e < 1, 




r = ^sec 2 (V>/2), 


e = 1, 


(4.58) 


r = a(l — ecosh if). 


e > 1. 





Astronomers call the angle if the eccentric anomaly. The relationship 
between the eccentric anomaly if and the true anomaly <p may be obtained 
from Eqs. (4.52) and (4.58) so that one finds the angles are related as 

[l+~e 

tan(</>/2) =\ tan(^/2), e < 1, 

V 1-e 

tan(</>/2) = tan(^>/2), e = 1, (4.59) 

le+1 

tan(0/2) =\ tanh(^>/2), e > 1. 

V e-1 

For the elliptic case (£ < 0) the angle <f revolves continuously with period 
27 t as if sweeps through its range of values from — oo to +oo; but for the 
hyperbolic case (£ >0), the sweep of if over its range sweeps the angle 
(f only over the half-turn from — 7r/2 to +7 t/ 2 as required by the third of 
Eqs. (4.59) in which the tangent is made equal to the hyperbolic tangent. 
Thus, the continuous parameter if naturally generates both the infinitely 
periodic, repeating, bound orbits of the elliptic case and the single-pass, free 
orbit of the hyperbolic case. The parabolic case (£ = 0) again demonstrates 
its special character: it is integrable in elementary functions as signaled by 
the equality between </> and if in the second of Eqs. (4.59). 

Introduction of the eccentric anomaly through Eqs. (4.58) into the time 
equations (4.56) and (4.57) releases simple integrals for elliptic, parabolic, 
and hyperbolic trajectories: 



cot — if — esinif, 


e < 1, 




coot = tan(0/2)[l + tan 2 (</>/2)/3], 


e = 1, 


(4.60) 


cot = e sinh if — if, 


e > 1, 





where the constants of integration have been chosen so that if = (f = 0 at 
t = 0. 

The orbit variables r and t are parametrically represented in terms of if 
through Eqs. (4.58) and (4.60). As if traces out its excursion from — oo to 
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+oo, r traces excursions out and back over its range between r m i n and ?’ max , 
</> completes periodic revolutions of 27r for the elliptic case or a half-turn 
from — 7 t/ 2 to +7 t/ 2 for the parabolic and hyperbolic cases, and t increases 
with ip monotonically save for a periodic fluctuation proportional to the 
eccentricity e. 

While the elliptic and hyperbolic trajectories must be represented in 
terms of the eccentric anomaly ip, the parabolic orbit is directly expressible 
in terms of the true anomaly <j> by the second of Eqs. (4.60).* 

For elliptic orbits, ip generates periodic orbits in </> according to the first 
of Eqs. (4.59). To find the time T for one complete orbit on the ellipse, set 
t = T and ip = 2n in the first of Eqs. (4.60) and find 

T = — = nk\J [xf2{— £) 3 . (4-61) 

LU 

The period of the orbit is revealed to depend only upon the square root of 
the reduced mass of the bodies and the inverse | power of their total energy. 
Massive bodies have long periods. Energetic bodies have short periods. It 
is also possible to express the period in terms of the linear dimension of the 
orbit by eliminating £ in favor of a from the first of Eqs. (4.54): 



T= 2n^pjka 3/2 . ( 4 -62) 

The time T to complete a revolution around an elliptical orbit may also 
be conveniently determined from Kepler’s second law by separating the 
variables: 

dA = (M/2/x) dt, 

from which 

T = j (2 n/M) dA = 2nA ellipse /M. 

" -^ellipse 

Since the area of an ellipse is nab, the period is T = 2n\J p/ka 3 ^ 2 when 
the relations (4.54) are used to eliminate M and b. Kepler’s third law is 
contained in Eq. (4.62): the periods of the planets are proportional to the 
linear dimension of the orbit to the | power. 

The time dependence of the hyperbolic orbits is the same as that for the 
elliptic orbits with hyperbolic functions replacing circular functions. (The 
times for the hyperbolic trajectories can be obtained from those of the 
elliptic trajectories by making the angle and Kepler frequency imaginary: 
ip — > iip, u> — > iu>. This is because of the square-root dependence of w on 
£ which switches sign for the two cases. Only a finite amount of time is 



* This elementary integral is obtained by substituting the parabolic expression for r 
from Eq. (4.58) into the time integral (4.57) with the result ujot = J sec 4 (</>/2) d(f). The 
integral is then developed by breaking the integrand into two terms through the identity 
sec 4 (0/2) = sec 2 (</>/ 2) [l + tan 2 (^>/2)] which leads to the second of Eqs. (4.60). 
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required for the body to repeat its trajectory on an elliptic orbit since the 
path length of the orbit is finite. Since the path lengths of the parabolic and 
hyperbolic trajectories are infinite and the body moves at finite velocity, 
the time required to complete a transit over the entire trajectory is likewise 
infinite. 



The Kepler Equation 



Equations of disarming simplicity often provoke epochal developments in 
mathematics. One such equation is the time equation of two-body orbits, 



cot = if — esinip. 



It is known as Kepler’s equation. This modest equation spurred the work 
of many of mathematics’ most fruitful contributors including Newton, La- 
grange, Euler, and Augustin-Louis Cauchy. Equations (4.60), of which 
this is the elliptic case, are more useful to astronomers when the eccen- 
tric anomaly is expressed as a function of time. Inversion of the Kepler 
equation to ip = ip(t) served as a major challenge to mathematicians of the 
seventeenth and eighteenth century, giving birth to major developments in 
analysis as well as the theory of functions of complex variables (Note 5). 

Before the seventeenth century algebraic expressions dominated analysis. 
[An algebraic expression is a polynomial of the form ip(t) = ao + aq t + 
02 1 2 + • • • + a n t n with a finite number of terms.] It was natural to wonder 
if the inversion of Kepler’s equation could be developed as an algebraic 
expression. In a masterful proof in the Principia, Newton showed this 
is impossible; moreover, his proof has even wider implications. It shows 
that the area of any smooth oval cannot be expressed algebraically. This 
connection which relates algebraic area integrals generally and the time 
integral for Kepler motion springs from Kepler’s second law: the times for 
orbits are area integrals over these orbits.* 

Since the time is an area integral of the ellipse — a smooth oval — it in- 
creases periodically by the same amount every time the true anomaly <p 
completes a cycle of 27 t. As the eccentric anomaly ip sweeps over its 
range, infinitely many cycles of <p result. The resulting equation ip = ip(t) 
must therefore have infinitely many roots. (They are, in fact, t = riT : 
n = 0, ±1, ±2, . . . .) Since an algebraic expression has only a finite number 
of roots equal in number to the order n of the polynomial, it cannot be the 
time integral of Kepler motion. 



* A discussion of Newton’s proof may be found in V. I. Arnol’d, Huygens and Barrow, 
Newton and Hooke, Basel: Birkhauser, 1990. 
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Figure 4~15. An Algebraically Integrable Oval. The oval = (1 — r 2 )\/] + r 2 where 
r = tan <p has an algebraic area integral j (r 2 /2) dtp = t 3 (t 2 /5 — 1/3). 

This result, framed in the language of time and the Kepler ellipse, is 
more general. Newton showed the area integral of any smooth oval cannot 
be algebraic! This does not mean the area integral of any closed curve 
cannot be algebraic. Orbits which are not smooth can possess algebraic 
area integrals. An example is the oval with kink of Fig. 4-15, 

r ( 4 >) = (1 - t 2 )^/ 1 + t 2 , 
where r = tan <j>. It has the area integral 

J (r 2 / 2) dcj) = t 3 (r 2 / 5 — 1/3) . 

Functions which are not algebraic are said to be transcendental and have 
expression as infinite series. The exponential and trigonometric functions 
are the most familiar transcendental functions. They are the real and imag- 
inary parts of the exponential function of z = x + iy whose infinite series 
expansion is 

OO 

e z = e x (cos y + isiny) = ^ z m /m\ = 1 + z + z 1 /2\ + z 3 / 3! + • • • . 

ra = 0 

The infinite series representing the transcendental solution of Kepler’s equa- 
tion (which is not so familiar) is 

OO ^ 

= wt + 2 — J m (me) sin mwt, (4.63) 

' m 

m=l 

where J m is the Bessel function. The detailed development of this solution 
may be found in Note 5. 
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The Bessel function J m (rne) appearing in Eq. (4.63) vanishes for e = 0 
and oscillates with a diminishing amplitude for large values of its argument 
me. This is just the required behavior to allow the angle for circular orbits 
(e = 0) to be directly proportional to time: ip — wf- But for e > 0, the 
Bessel function has the property of speeding up and slowing down the sweep 
of the angle giving the orbit angle a nonlinear dependence on time. As a 
result, the angle sweeps rapidly around the orbit near pericentron and more 
slowly around the region near the apocentron. The devices of deferents and 
epicycles which Ptolemaic astronomers introduced to force this effect in the 
motion of the planets is here naturally portrayed in the motion of Eq. (4.63) 
through the oscillatory property of the Bessel function. 

Collisions 

It is impossible for two point particles in one another’s attractive field to 
collide if the angular momentum remains finite. The separation between the 
bodies is given by the relative position vector r. Two bodies may approach 
no closer than the distance r m ; n which is proportional to the square of the 
angular momentum in Eqs. (4.48) and (4.49). This distance is measured 
from the center of mass of the bodies. A small caveat is therefore worth 
noting. Although two point particles with finite angular momentum may 
not collide, finite bodies may collide if their collisional cross-section radius 
(which is the sum of their body radii, r i + r 2 ) is greater than r m i n . 

When M — > 0, a two-body collision becomes possible. A collisional 
trajectory has eccentricity e = 1 even though the energy need not vanish. 
This is easily seen from Eq. (4.48) since £q — > oo when M — > 0. In 
the collisional limit the Kepler ellipse becomes more and more elongated 
and degenerates to a straight line oscillation along the eccentricity vector 
of Fig. 4-16 with r min = 0 and r max = 2 a from Eqs. (4.48) and (4.49). 
The origin representing the center of mass and the pericentron merge, the 
distant focus and the apocentron merge, and the moving point representing 
the bodies oscillates between these points over the distance 2a. Infinitely 
many collisions may occur with period T, a collision occurring each time 
the moving point returns to the origin. 

In the parabolic and hyperbolic cases (£ ^ 0) the collisional trajectory 
is also a straight line passing through the origin. In contrast to the elliptic 
case, the straight-line parabolic and hyperbolic trajectories pass to infinity. 
Only one collision occurs in the parabolic and hyperbolic cases since there 
is only one nonperiodic contact with the origin. 

Collisions introduce a delicate feature into the classical mechanics of 
point particles: the existence of singularities. A singularity is a point where 
one or more quantities become infinite. The inverse-square force is infinite 
at the point of collision. Why then does it not lead to an infinite speed of 
the particles? The answer is that the impulse integral f (k/r 2 ) dt containing 
the point r = 0 is finite. 
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Figure j- 16. Collisional Trajectories. The ellipse (a) of an incipient collisional orbit 
degenerates to straight-line oscillations along the eccentricity vector when e — » 1 (b). 
The pericentron and origin merge at r m ; n = 0 and the distant focus and apocentron 
merge at r max = 2a. 

For two-body motion with gravitational and electrical forces, infinite 
speeds of particles can never be established in finite times. Remarkably, in 
the case of several bodies attracted by inverse-square forces it is possible 
to arrange them in a way so as to accelerate one of the bodies to infinite 
speed in finite time — the many-bodies act like “sling-shots” on one of the 
bodies. One can also devise peculiar force laws that accelerate two bodies 
to infinite speeds in finite time.* 



Heavenly Tori 

When the orbits are bound, Kepler motion (like all bound, integrable 
motion) may be reduced to a form of ultimate simplicity: the motion lies 
upon an invariant torus in phase space most naturally described in angle- 
action coordinates. Since the relative motion is three-dimensional, this 
torus is a three-torus built up as a Cartesian product of three circles, each 
of which is described by one of the three angle coordinates. The three- 
torus is a toroidal body foliated with nested two-tori. Since the motion 
is fully degenerate, the flow winds once around these two-tori on closed 
streamlines. 

Let us exhibit the invariant tori of Kepler motion. The invariant set I 
of the motion consists of the single-valued action coordinates J and the 
trajectory flow is a composition of three periodic oscillations, each charac- 
terized by an invariant amplitude and an angle which rotates at an invariant 



* See Florin N. Diacu, “Painleve’s Conjecture,” The Mathematical Intelligencer, 15 , 
No. 2 (1993). 




130 



The Shaggy Steed of Physics 



frequency. These angle-action coordinates of integrable motion arise in a 
canonical transformation from either the vector or spinor representation of 
the separated coordinates and momenta. 

Because of the periodic nature of bound motion, the angle coordinates 
are of two kinds: rotations and librations. A rotation coordinate repeatedly 
passes through periods of 2n. A libration coordinate oscillates back and 
forth between two extremes. In Kepler motion the angular coordinate <f> is 
a rotation coordinate successively passing through periods of 2n and the 
radial coordinate r is a libration coordinate oscillating between r m i n and 

T max* 

The action coordinates of the vector representation are obtained from 
Eqs. (4.34) through (4.37): 



J r = (27 r) 1 i/2/i \H + k/r — M 2 /2/xr 2 ] dr, 

J^ = {2n)~ 1 j M 3 d(t>, 

Jg = (27 T )" 1 j \J M 2 — M 3 2 / sin 2 6 dJ9. 



The complicated appearance of these integrals is misleading; for they are 
quite simple. This is because they represent the contour integral of a com- 
plete orbit in phase space. (They would not be so simple if the integral 
were not over the complete orbit.) Since they are invariants, one knows 
the actions must be expressible in terms of the Casimir set (H, M, M 3 ), 
or since the energy may equally well be expressed in terms of the Kepler 
constant, in the form in which all quantities have the dimensions of action, 

(h,M,M 3 ). 

The polar angle action from Eq. (4.64) is immediately integrable as 

J(j) = (27t) _1 / M 3 # = M 3 . (4.65) 

Jo 

The two invariants M 3 and M define a cone on whose surface the angu- 
lar momentum vector must lie. The half-angle \ of this cone is given by 
sin% = ( M 3 /M ). As the motion of the orbit traces out a complete period, 
the latitudinal angle 6 librates between — \ and Th e latitudinal action 
variable from Eq. (4.64) thus covers the range j> d6 = 2 f* dO: 

Jg = ( 7 r)~ 1 M f \J^~ sin 2 %/ sin 2 9 d6 
J-x 

and has the integral 

Jg = M( 1 — siny) = M — M 3 . 



(4.66) 
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The radial coordinate r librates between r m and r max - The range for 
the integral J r is therefore j> dr = 2 J r rmax dr. This integral is most naturally 
evaluated with the theory of residues, a method of evaluating closed contour 
integrals in the complex plane.* It turns out to be simply 

J r = h- M. (4.67) 

It follows from (4.66) and (4.67) that the sum of the actions is the Kepler 
constant, 

h — J r - (- J ^ T J g , 

The action coordinates are invariants. The more familiar forms of the 
invariants for separated Kepler motion in splrerical-polar coordinates — 
the invariant set ( TL , M, M 3 ) — may be expressed in terms of them through 
Eqs. (4.65)-(4.67) and the condition H = —k 2 n/2h 2 : 



M3 = J,/,, 

M = Jff, + Jg , 

= k 2 n/ 2 

( Jr + J<f> + Jq ) 2 



(4.68) 



The phase velocities in angle-action coordinates are u>i = dH/dJi. It is 
apparent from the last of Eqs. (4.68) that all three phase velocities u r , w#, 
Log will be identical for the Kepler Hamiltonian since the action coordinates 
appear in the sum ( J r + + Jg). This single, common phase velocity is 

found from Eq. (4.68) to be the Kepler frequency (4.55): 



k 2 ^i 

(Jr + J^ + Jg) 3 ' 



(4.69) 



The equality of all three frequencies manifests the complete degeneracy of 
Kepler motion. Equality of and tog corresponding to the appearance 
of and Jg in the sum ( + Jg) is the signature of the central force 
degeneracy. Equality of tu r with and u>g corresponding to the appearance 
of all three action coordinates in the sum ( J r + J ( / > + Jg) is the signature of 
the additional inverse-square force degeneracy. 

The simplicity achievable with the angle-action representation is further 
evidenced by the Hamiltonian. From Eq. (4.69), it may be expressed in 
action coordinates in the simple form 



TL = -JiOJi / 2 

and may be compared with the far more involved form of Eq. (4.29). The 
final invariant of Kepler motion, the eccentricity e, is also expressible in 



* Arnold Sommerfeld, Mechanics, New York: Academic Press, 1956; H. Goldstein, 
Classical Mechanics (2nd Ed.), Reading Mass: Addison Wesley, 472-478, 1980. 
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action invariants, 
action variables is 



Since e 2 = 1 — M 2 /h 2 , the eccentricity magnitude in 



= 1 



J(f> + Jo \ ~ 
Jr T J<fi T Jo J 



(4.70) 



Reduction to angle-action coordinates is also possible for the spinor rep- 
resentation of Kepler motion. In this case the action coordinates are Je, 
J v , and Jcf, and they are expressible in the spinor Casimir set (TL, E 3l M 3 ). 
The action variable J $ is identical to that for splrerical-polar coordinates. 
The remaining action coordinates may be constructed from Eqs. (4.46) as 



Je = (2<r^ f [n+( 1 - e 3 )k/Z - M 3 2 / 2^ 2 ] dt, 

Jr, = (27t) _1 ^ j) \J 211 [H + (1 + e 3 )k/r] - M 3 2 / 2 prj 2 ] dp, 



(4.71) 



(note the factor 1/2 in front of these integrals, a sign that these are spinor 
representations). These integrals may be evaluated by again using the 
theory of residues with the result 



J<j> = M 3 , 

= h/2 - ( M 3 + E 3 )/2, (4.72) 

Jr, = h /2 - (M 3 - E 3 )/ 2 . 

Since the parabolic coordinate form of Kepler motion is a spinor repre- 
sentation, one expects the parabolic actions to be spinors. This is indeed 
the case; the parabolic actions J £ and J v are found from Eqs. (4.22) and 
(4.72) to be none other than the spinor invariants of the vectors S and D 
that belong to the pair of two-splrere families that cover the three-sphere 
of four-dimensional rotational symmetry: 

J i = S ± , J V = D ± . 

The spinor Casimir set (h, E 3 , M 3 ) is expressible in action coordinates as 



M 3 — J(f), E 3 — Jr, — h — Jrj + J^ + J f. (4.73) 

The corresponding Hamiltonian has a form similar to that for the vector 
representation, 

= k 2 n/2 

( J(j> + J$ + Jr ,) 2 

In passing, one notes that the Poincare invariant S = § p., dq, of integrable 
motions is a sum of the action invariants, S = 2tt ^^Ji- The Poincare 
invariant of Kepler motion is independent of the representation; for either 




4- Classical Mechanics: The Heavens 



133 



the vector or spinor representation it is (aside from a factor of 27 r) the 
Kepler constant: 

S = 2nh. 



The Heavenly Spheres 

As a bound, integrable motion, Kepler motion lies upon a three-torus. 
From the viewpoint of its symmetries, however, Kepler motion lies upon a 
three-sphere which is the basis of its four-dimensional rotational symmetry 
(the Kepler three-sphere is explicitly exhibited in Note 6). One therefore 
has two different topological representations of Kepler motion: a three- 
torus filled with two-tori upon which streamlines close after one turn and 
a three-sphere filled with a pair of families of two-spheres upon which the 
trajectories also wind. 

How can the motion lie both upon a torus and a sphere? This unique oc- 
currence is due to the exceptional properties of four-dimensional space and 
the three-sphere supporting its rotational symmetry. Although the sphere 
and torus are in general topologically distinct objects, in the remarkable 
case of the three-sphere, they come into coincidence. Kepler motion simul- 
taneously lies upon a sphere and a torus! Let us see how the tlrree-torus 
and three-sphere are united in Kepler motion. 

An instructive way of seeing the topological equivalence of the three- 
sphere and the three-torus is to turn one into the other with operations 
that preserve topology. One begins with the three-sphere as represented 
by a pair of three-dimensional spherical bodies “balls” joined along a great 
two-splrere in Fig. 4-17 (b). First, cut the bodies apart along the great 
two-sphere (the common “skin”) which joins them, separating them as in 
Fig. 4-17 (c). (The bodies will subsequently be glued back together on this 
surface.) Next, open a toroidal cavity with surface C in one of the bodies, 
say Ai, as shown in Fig. 4-17 (d). Next slice through the body A\ to the 
inner edge of the toroidal cavity thereby exposing the surfaces B[, B" as 
in Fig. 4-17 (e). Now draw out these exposed surfaces so that the surface 
C of the toroidal cavity becomes the outer surface of a growing toroid in 
Fig. 4-17 (f). The formerly outer spherical surfaces A[, A'{ along with B[, 
B " now become the cross-sectional surface of the emerging toroid. (It is 
important to note that although the sliced surfaces B[, B " can be joined 
back together to complete the torus as in (g), the surfaces A \ , A" cannot 
be joined in this way because they were severed along their edges, not their 
faces. A single two-splrerical body cannot be turned into a torus. It is the 
second spherical body joined with the first that makes completion of the 
torus possible.) The surfaces A' 2 , A! 2 of the second spherical body can now 
be joined to the surfaces A \ , A'{ of the first as they were in the original 
(a)-(b). The surfaces B [ , B’{ can also be joined together as in Fig 4-17 (g) 
thereby creating the three-torus foliated with two-tori (h). 
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Figure J^-17. Topological Equivalence of the Three-Sphere and Three-Torus of Kepler 
Motion. The Kepler three-sphere (a) is topologically equivalent to two interpenetrating 
spherical bodies (b) and may be turned into a three-torus (h) by unfastening the bodies 
along their surface skin and separating them (c), opening up a toroidal cavity C in one 
of them (d), slicing into the cavity and drawing out a proto-torus (e), (f), and closing 
it by rejoining the sliced surfaces and refastening the spherical bodies by their skins (g) 
to yield the integrable three-torus upon which the motion also lies (h). 
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The complete motions of the two-body problem of celestial mechanics, 
“The Kepler Problem,” have been revealed; and we have rediscovered the 
perfection of the heavenly spheres in new raiment. The motion of two 
bodies in its most penetrating and elegant mathematical form consists of 
perfect circles which build invariant tori. Heavenly bodies lie upon heav- 
enly tori; and these heavenly tori are coincident with heavenly spheres, a 
correspondence unique to Kepler motion. 

The underlying symmetries of the solar system are indeed the symmetries 
of perfect spheres — the rotational symmetries of spheres in four dimensions. 
We do not directly see this underlying spherical symmetry in the configu- 
ration space of out experience — we see ellipses. But mathematics provides 
us the ability to see into phase space. And in the full world of motion that 
is phase space, spherical symmetry shines forth. In so doing it reveals that 
the heavenly spheres have a richer structure than the ancients imagined. 
The heavens are an outstanding foliation of spheres within spheres which, 
by the magic of four-dimensional space, are at the same time a foliation of 
tori within tori. 



Reprise 

The solution of the Kepler problem epitomizes the unity and harmony 
of mathematics and the material world. One begins with the symmetries of 
space and time and the simple law of least action. It offers up the equations 
of motion, the universal Galilean invariants, and a new invariant which 
exists only for the inverse-square force, the force of gravity and electricity. 
One asks for the bound orbits by restricting the energy to the range £ < 0. 
An ellipse results. The geometric properties of the ellipse — its axes and the 
periodic time to complete a circuit around the orbit (a, 6, T ) — though pure 
mathematical quantities are, through the laws of motion, endowed with 
mass, energy, and angular momentum — the stuff of materiality: 

a = k/2£, b = M / \f2p\£\, T = n k\J p,/ 2|£| 3 . 

The inverse-square force of gravity within the space and time of rota- 
tional and translational symmetry is one of the great unifying themes of 
the physical world and a key to understanding all heavenly motions. Yet 
this force is the unifying feature not only for the planetary motions of the 
heavens but also for the atom and the elements which are the stuff of heav- 
enly bodies. The unity is even more manifold than Newton, Galileo, and 
Copernicus might have imagined. 

One stands in amazement before the far-reaching presence of the four- 
dimensional rotational symmetry of Kepler motion in the world. The ulti- 
mate building block of this symmetry is the spinor, the most elementary ob- 
ject possessing rotational symmetry. A single spinor builds a three-vector. 
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A pair of spinors also builds a pair of three-vectors which describe a four- 
dimensional rotation. A pair of spinors turns out to be the mathematical 
signature of the electron and its antiparticle ,s the positron. The same rota- 
tional symmetry generalized to complex quantities and higher dimensions 
explains much of the structure of the most elementary particles known to- 
day. 




Chapter 5 



Quantum Mechanics: The Elements 



Consider the solar system: an ensemble of isolated point-like planets or- 
biting the sun through the vast tenuousness of interplanetary space. Con- 
sider the elements: rich textures and colors, a diversity of gaseous, liquid, 
and solid phases, abundant forms of chemical reactivity. These two as- 
pects of the natural world appear incomparable. Yet the heavens and the 
elements are possessed of a striking unity. 

The heavens and the elements are both the manifestation of particles in 
motion bound by the inverse-square force: sun and planets which constitute 
the solar system; electrons and charged nuclei which constitute the atom. 
Both are governed by the principle of least action and shaped by the same 
Hamiltonian structure. Both partake of the same symmetries and possess 
the same invariants. The prototypical planetary orbit and the prototypical 
element both blossom from four-dimensional rotations. 

How, then, explain the differences? To be sure, the heavens and the 
elements are sustained by different forces, gravity on the one hand and 
electricity on the other. Though both are inverse-square forces, the strength 
of the electrical force between two electrons is forty-two orders of magnitude 
greater than that of their gravitational force. So there is a decoupling that 
takes place. The large-scale world bound by gravity goes its way with only 
a loose connection to the microscopic world bound by electricity. Yet this 
is not the whole story. 

The decisive difference between the heavens and the elements, between 
solar system and atom, turns on the fact that quite independently of the 
kinds of forces acting, all particles are ultimately wave-like. Pressed to 
its limit, a particle is not a little marble; it is a bundle of waves with a 
definite wavelength that specifies the distance between the alternate crests 
and valleys of its wave structure. The gravitational force binding the solar 
system does not push the sun and planets to this limit; but the electrical 
force binding the electrons to the nucleus of an atom does push atomic scale 
particles to this limit. The mechanics of the bodies of the solar system is 
a particle-like classical motion. The mechanics of the microscopic particles 
which constitute the atom is a wave-like or quantum motion. 
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The movement from classical mechanics to quantum mechanics is revo- 
lutionary. Yet the central concepts of mechanics are preserved in integrity. 
The law of motion has a universal character. Least action, invariants, 
symmetries, and Hamiltonian structure pervade both the classical and the 
quantum world. Indeed, the quantum enhances and deepens their signifi- 
cance as it introduces new features unique to the quantum realm. 

The quantum scale is extraordinarily small by human standards, roughly 
a billion times smaller than the human hand. This is why the discovery of 
quantum phenomena is a late development in human history. The world 
of the quantum is a microscopic world of particles — the atom, electrons, 
protons, neutrons, and a host of others. Yet the effects of the quantum 
are not confined to the microscopic. The quantum signs its presence in 
the large-scale world in the unique structure of the elements out of which 
the chemistry of nature and the diversity of matter arise. The diversity of 
the elements rests upon the quantum interaction of charged particles. We 
begin our journey into quantum mechanics with a survey of these particles. 



Atom: Source of the Elements 

The atom consists of a nucleus bound by the electrical force to a constel- 
lation of orbiting electrons. The nucleus of the hydrogen atom consists of a 
single proton. But all other atomic nuclei consisting of more than one pro- 
ton also contain neutrons which bind the protons together with the strong 
nuclear force (the force must indeed be “strong” since it binds protons into 
a nucleus with a greater strength than their repulsive electrical force which 
would otherwise fling them apart) . The nucleus extends over an exceedingly 
small region of space whose breadth is ten thousand times smaller than the 
spatial extent of the electronic orbits. If the nucleus were represented by 
the sun, the electronic orbits would be located in the vicinity of and beyond 
Pluto, the most distant planet. 

The total number of negatively charged electrons in an atom is equal 
to the number of positively charged protons in its nucleus. The atom as a 
whole is therefore electrically neutral. The prototypical atom is the hydro- 
gen atom. It consists of a nucleus formed of one proton to which a single 
electron is bound by the inverse-square electrical force. 

Each element is characterized by the total number of electrons (or pro- 
tons) which constitute its atoms. The weight of an element (or the atomic 
weight) has a one-to-one correspondence with the number of orbiting elec- 
trons. The periods of the Periodic Table which are based upon atomic 
weight are therefore uniquely correlated with the number of electrons (or 
protons) of the atom of a given element. Periods of 2 , 8 , and 18 which 
describe the chemical and physical properties of the elements are the in- 
escapable pattern in the Periodic Table as shown in Fig 2-4. Atoms with 2 
electrons behave similarly to atoms with 8 and 18 electrons. Atoms with 3 
electrons behave similarly to atoms with 9 and 19 electrons, and so on. 
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The elliptical patterns and transit times in the motions of the heav- 
enly bodies are a manifestation of the law of motion for an inverse-square 
gravitational force. It is no small marvel that the rhythmic patterns of 
the Periodic Table are a manifestation of the same law of motion with an 
inverse-square force identical to that of gravity. 

The action principle for gravitational potential energy governs the uni- 
verse in the large. If the action principle with an electrical potential energy 
identical in form to that of gravity governs the atom, then it should lead to 
geometrical structures for the atom identical to those of the solar system. 
Two attractive charges may form bound elliptical orbits. They may also 
form unbound parabolic and hyperbolic trajectories. 

So there it is: a miniature solar system with a charged nucleus playing 
the role of the sun, orbiting electrons playing the role of the planets, and 
the electrical force playing the role of gravity. It is remarkable that this 
model proved to be correct. It possesses devastating contradictions. 

First, an atomic scale solar system has an amorphous structure. An elec- 
tron orbiting the nucleus according to the mechanics of Newton, Lagrange, 
and Hamilton would indeed move in an elliptical orbit; but an ellipse in 
itself has no obvious connection to the repeating properties of the Peri- 
odic Table. Changes in the energy and angular momentum of the elliptical 
orbit are not reflected in any obvious way with changes in the chemical 
properties of the elements. The energy and the angular momentum change 
continuously. But the chemical shifts are in discrete jumps from element 
to element. Shifts in the number of electrons elliptically orbiting a nucleus 
also do not in themselves explain the dramatic shifts of chemical properties 
and repeating cycles which the chemical elements exhibit. Structured as 
the ellipse might be, it does not possess a microscopic richness that flowers 
with the diversity of the elements. 

Second, such a miniature solar system is unstable. Electromagnetic the- 
ory predicts that an orbiting electron should radiate energy. As the electron 
loses energy by radiation, its orbit should spiral inward to the nucleus in 
an ultimate collapse of the atom. The Newtonian mechanics of a charged 
nucleus with an electrically bound electron yields an inherently unstable 
atom. 

Numerous experiments of the early twentieth century substantiated the 
existence of the nucleus and the electron and verified that the binding force 
between these two particles was indeed the electrical inverse-square force. 
Unfortunately, when these two particles are allowed to interact according 
to the laws of mechanics revealed by Newton, Lagrange, and Hamilton, 
they yield the family of conic sections — dynamical structures that cannot 
explain the behavior of the elements. 

Niels Bohr resolved this contradiction by proposing that at the micro- 
scopic level of the atom one must regard the motion of a point particle 
orbiting another particle as a wave. A wave arrayed around a closed orbit 
cannot close on itself unless it consists of an integer number of wavelengths. 
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The length of the orbit must therefore be an integer multiple of a funda- 
mental length (fixed by a new constant of nature first discerned by Max 
Planck). The orbit is quantized : the states of motion of the electron are 
discrete waves denumerated by integers. 

What Kepler did for classical mechanics with the ellipse, Bohr did for 
quantum mechanics with the quantized orbit. As Newton would subse- 
quently produce the Kepler ellipse from a more fundamental law of motion, 
Wolfgang Pauli would subsequently produce the Bohr orbit from the more 
fundamental laws of quantum mechanics established by P. A. M. Dirac 
and Werner Heisenberg. Overshadowing these developments in the law of 
motion are the symmetries of two-body motion with gravitational and elec- 
trical forces. The eccentricity invariant of Jacob Hermann persists from the 
classical realm into the quantum realm where Wolfgang Pauli would reveal 
its quantum form. The symmetries of four-dimensional rotations play a 
central role in both the classical and quantum motion of two bodies. 



Particles and Waves: Dichotomy and Unity 

At first blush, particles and waves seem contradictory manifestations 
of objects such as electrons and protons. It does not seem possible that 
an electron can be both a point particle and a wave. The resolution of 
this dichotomy hinges on a proper recognition of the importance of scale 
in determining which kind of properties the electron can be expected to 
exhibit. The length scale of a particle such as an electron is its wavelength. 
When it interacts with objects whose length scale is much longer than its 
wavelength, it acts like a point particle. When it interacts with objects 
whose length scale is comparable to or shorter than its wavelength, it acts 
like a wave. 

An elementary wave consisting of a single frequency to and traveling in 
the direction of the wave vector k may be represented as a wave function, 

4/(x,t) = Ae i(kx ~ ut \ (5.1) 

where A is a real amplitude which specifies the strength of the wave. In 
the case of the waves of classical motion — water waves, sound waves, light 
waves, even gravity waves — the complex nature of the wave function is 
purely a mathematical shorthand. The physically observable properties of 
the wave are obtained by taking either the real or imaginary part of the 
wave function such as 3?(4') = A cos(k ■ x — ut) or 9(T) = A sin(k ■ x — cot) . 

But in the case of quantum waves, the wave function has a radically 
different reality. In contrast to either the physically observable 3?(T) or 
3(4') of a classical wave, the physically observable aspect of a quantum wave 
combines both the real and imaginary parts of the wave. It is the amplitude 
iTj = ^^(T) 2 + 3(4') 2 ] = A which is the quantum observable, a telling 
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departure from the behavior of classical waves. This means there is always 
an inaccessible phase factor in a quantum wave that is not available to us. 
This hiddenness within quantum waves underlies the deepest aspects of 
the quantum world, including the famous Heisenberg Uncertainty Principle 
which tells us the act of observing one property of a quantum particle, 
such as its position, can render other properties of the particle, such as its 
momentum, unobservable. 

Under classical conditions one says a particle is located at the point 
x. Under quantum conditions, one says the particle is located somewhere 
within the region where the amplitude of its wave function j *F (x) | has sig- 
nificant value. The point x of a classical particle is replaced by the “smear” 
j*F(x)| of a quantum particle as shown in Fig. 5-1. The |’P(x)| smear is di- 
rectly related to the probability of finding the particle at the point x. (The 
probability density of the particle will turn out to be the square amplitude 
j*Fj 2 as we shall find below.) The particle has no likelihood of being in re- 
gions where |'P(x)| 2 vanishes and a very great likelihood of being in regions 
where j'P(x)| 2 has large values. Thus, in the quantum world, the point lo- 
cation of a particle is replaced by a distribution throughout space, the wave 
function T, or as it has come to be known, a quantum field. The spread 
of the “particle” throughtout space in a field gives the particle wave-like 
properties. 



-» —a: 




(b) 



Figure 5-1. Classical Point Particle and Quantum Wave. The classical point x of a 
particle’s position (a) becomes the quantum smear 'I' ( x ) (b). 

This idea is more precisely illustrated in Fig. 5-2. For particle wave- 
lengths short compared to the scale of the objects with which it interacts, 
the point properties of a quantum particle are pre-eminent. The wave-like 
oscillation of the particle cannot be detected when it passes through a slit 
which is long compared to its wavelength. The particle acts as if it were 
flying freely through an open window and strikes the target with the preci- 
sion of its original aim. The particle, even though it be a wave, acts in this 
situation just as if it were a point. 
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(a) 




Figure 5-2. Particle-Wave Duality. At wavelengths short compared to the slit with 
which it interacts (a), the particle appears as a point flying freely through the slit. But 
at wavelengths A comparable to or longer than the slit width l the particle appears as a 
wave and is diffracted by the slit (b), (c). 



But for particles whose wavelength becomes comparable to or longer 
than the length scales of the objects with which it interacts, the wave nature 
becomes pre-eminent. Even if it is aimed precisely at the center of the slit, 
the particle does not fly freely through it. The particle waves collide with 
the slit walls causing it to be diffracted and its location is made uncertain. 
It no longer lies on the straight-line trajectory through the center of the 
slit, but is distributed over the target. 

If a particle is to be identified with a wave represented by the wave 
function T, the wave function must, in the classical limit, reproduce that 
mechanical quantity embodying its full dynamics, its action S. The action 
of a motion is equal to the sum of the actions of its independent parts. The 
wave function of a motion 4'(x, t ) , however, represents a probability for the 
presence of the particle at a particular point. The probability of a sum 
of independent parts is the product of the probabilities of the parts. The 
relationship between the wave function 4' and its corresponding action S is 
therefore a relationship between functions that are multiplicative on the one 
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hand and additive on the other. The transition from classical mechanics 
to quantum mechanics can therefore be made near the classical limit by 
representing the wave function as an exponential of the action, 

T = \y\e iS/R , (5.2) 

where T is the amplitude of the wave function and h is a constant of 
nature required to render the wave function dimensionless. The action has 
dimensions length 2 x time and the dimensions of the constant fi , which 
arises in the identity between particles and waves, must likewise have the 
dimensions of length 2 x time, the dimensions of action. 

The identification of Eq. (5.2) with Eq. (5.1) reveals that the action is 
represented in terms of wave vector and frequency as S = h(k ■ x — tot) and 
has the derivatives 



dS/dx = hk, dS/dt = — hu>. (5.3) 

But the space and time derivatives of the action are the momentum and 
energy as given by Eqs. (3.26): 

dS/dx = p, dS/dt = -H. 

The wave vector and frequency of a wave are therefore given in terms of 
the momentum and energy of a particle by 

k = p/h , u> = H/h. (5.4) 

Equation (5.4) embodies the duality between particles and waves. Dynam- 
ical quantities such as p and H are equivalent to the wave vector k and 
frequency u> of a wave. 

The quantum revolution can be regarded as a transfiguration of the clas- 
sical in which the phase flow, rather than consisting of particle streamlines, 
consists of waves which both constructively and destructively interfere. Me- 
chanical quantities which are properties of the flow such as momentum, 
angular momentum, energy, eccentricity, and so on are no longer regarded 
as ordinary algebraic functions of space-time points. Rather, they are re- 
garded instead as more creative mathematical objects which will manifest 
themselves in waves. Such wave-like mechanical objects are known as op- 
erators. 

An operator is a supple mathematical creation. It has the capacity to 
generate both wave-like objects as well as those which compact down to 
a point. An operator is not the measurable value of the physical quantity 
it represents in the way classical mechanical variables take the measurable 
values of quantities they represent. Instead, operators generate a whole 
spectrum of values unique to a particular mechanical quantity which oscil- 
late, decay, or grow like interacting waves of light or sound. A measurable 
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value of a physical quantity turns out to be one of the values in the spec- 
trum. 

The nature of an operator may be established by noting that the wave 
function (5.1) reproduces itself upon differentiation with coefficients which 
are its wave vector and frequency: 

W/dx = iky, d^/dt = —iu'b. (5.5) 

On making the identification of (k, u>) with (p, H) in Eqs. (5.4), the forms 
in Eq. (5.5) can be expressed as 

(—ihd/dx)^ = pT, (ihd/dty k = H\ E (5.6) 

These identities introduce dynamical operators: 

p = — ihd/dx , H = ihd/dt, (5.7) 

in terms of which they become 

pT = pT, HV = H^. (5.8) 

The wave-particle duality leads us to the remarkable identities (5.8). 
These austere mathematical relationships — uncanny in their simplicity and 
mesmerizing in their iconography — tell us that an operator such as p ap- 
plied to a function T yields precisely that same function with a coefficient 
which is the observed value p of that dynamical quantity. Moreover, al- 
though the fundamental operators (5.7) were discovered by making connec- 
tion with the classical limit, they prove to be universally valid for the full 
range of quantum behavior. 



The Quantum Rendering of Mechanics 

Particles beget mechanical quantities which behave as algebraic vari- 
ables. Waves beget the same mechanical quantities but they behave as 
operators. When particle wavelengths become short compared to the scale 
of motion, quantum operators give results like classical mechanical quanti- 
ties. Quantum waves settle down into streamlines and become the classical 
phase flow. One may therefore regard quantum mechanics as the funda- 
mental theory of motion which contains classical mechanics as its limit at 
large quantum wavelengths. The quantum world of the atom therefore ap- 
propriates the classical mechanics of Chapters 3 and 4 with the simple, if 
epochal, change that mechanical quantities become operators. 

Thus, to every mechanical quantity A (which may represent momentum, 
angular momentum, energy, eccentricity, and so on) there corresponds an 
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operator A. A dynamical quantity and its operator are linked by the fun- 
damental identity connecting particles and waves for which Eqs. (5.8) are 
the prototypes: 

AT = AT. (5.9) 

The classical description of motion involves only one thing: a mechan- 
ical quantity A which is observed. The quantum description of motion 
involves three things: the wave function T, a mechanical quantity A, and 
its operator A. The wave function T represents a state of nature. The 
operator A acts upon T and represents the process of observation of na- 
ture. The observable A is the result of that process of observation. These 
three quantities — the state of nature T under observation, the operator A 
representing the observation, and the observable A representing the result 
of that observation — are intimately linked by the fundamental identity of 
quantum mechanics (5.9). 

The essence of the quantum is contained in the remarkable mathematical 
properties of the quantum identity AT = AT. This is an unusual kind of 
equation; for T appears linearly on both sides but A and A are distinctly 
different mathematical objects. The identity cannot be satisfied by any 
function T (other than the trivial one T = 0) unless A has rather particular 
values. Although the quantum identity appears formally to be an equation 
for T, it is an equation for both T and A. This mathematical property is 
precisely what is required physically. For it is the value of the observable A 
which one set out to determine in postulating the identity between particles 
and waves. 

The quantum identity of an operator is generally satisfied by not one 
but many values of T and A (often infinitely many). This is certainly 
required physically; for these are the observable values and one knows that 
mechanical quantities generally take a multitude of values. Each of the 
values A is called an eigenvalue of A (after the German word eigen , meaning 
special) and the corresponding function T that goes with that eigenvalue 
is called an eigenstate or eigenfunction of A. 

The fact that the quantum identity is satisfied only for special values of 
an observable shows that nature need not be present to us on the “full real 
number line” of classical physics. Whereas a mechanical quantity may take 
any value on the real number line in classical mechanics, it may only take 
the eigenvalues of its operator in quantum mechanics. These eigenvalues 
are not generally dense on the real line. The angular momentum operator, 
as a leading example, possesses discrete rather than continuous eigenvalues. 

Discrete spectra signal a qualitative reduction in the permissible values 
for mechanical quantities — a countable number of eigenvalues denumerated 
by the integers rather than the uncountable number of values on the real 
number line permitted in classical motions. The existence of discrete spec- 
tra for quantum observables is the origin of the term “quantum.” Clas- 
sically, we have the illusion of the full real number line because discrete 




146 



The Shaggy Steed of Physics 



quanta blur together giving the illusion of a continuum on classical length 
scales. 

Let us illustrate the operators for the most important mechanical quan- 
tities. The operator for position in space is postulated to be identical to its 
observable: 

x = x. 

On the other hand, Eqs. (5.7) indicate that the operators corresponding to 
the momentum and energy, p = —ihd/dx and H = ihd/dt, as well as the 
angular momentum, 

M = —ihxxd/dx, 

are differential operators. More complex quantities which involve position, 
momentum, and energy may be built up from the momentum and energy 
operators. 

The operator appropriate to the kinetic energy involves the squared mag- 
nitude of the momentum. For a single particle of mass m the kinetic energy 
operator is T = p 2 /2m. The operator p 2 is 

p 2 = -h 2 (d 2 /dx 2 + d 2 /dx 2 2 + d 2 /dx^ 2 ) . 

The quantum kinetic energy operator, aside from constant factors, is the 
Laplacian p 2 = — h 2 \7 2 = —h 2 d 2 /dx 2 . 

The action S is the fundamental mechanical quantity describing classical 
motion and the Hamilton- Jacobi equation which governs it is the master 
equation governing classical motion. What is the corresponding equation 
for the action in the quantum realm? It is the Sclrroedinger equation — the 
quantum identity for the energy. This equation results when one writes 
H = ihd/dt on the one hand, substitutes the Laplacian p 2 a = ~h 2 \7^ in 
H = T + V = X) a Pa/2 m a + V(x a ) on the other, and equates the results: 

ih^- = -^(h 2 /2m a )V 2 a 'l> + V(x a )^. (5.10) 

Ot 

The Sclrroedinger equation is the master equation of quantum motions, 
the quantum transfiguration of the classical Hamilton- Jacobi equation. 
The Hamilton- Jacobi equation issues from the Sclrroedinger equation in 
the classical limit. To see this connection, represent the wave function in 
the Sclrroedinger equation (5.10) in the form rk = |'F|e*' s / ,i , with S regarded 
as a real quantity that need not , in general, be the classical action. It only 
becomes so in the classical limit, as we shall now demonstrate. With the 
wave function expressed in terms of phase and amplitude, the Sclrroedinger 
equation elegantly produces two equations governing S and jTj.* The real 



* In deducing Eqs. (5.11) and (5.12) from (5.10), observe that by taking the gradient 
of = \^/\e lS ^, the gradient of the phase S is completely expressible in terms of the 
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part of the Sclrroedinger equation describes the evolution of the phase S: 

f ic 

— + £(1/2 m a )V a S-V a S + V(x a ) - (tf/m £ V Q 2 jT| = 0. (5.11) 

a. at 

The imaginary part evolves the square of the wave function amplitude 
|T| 2 = TT* which will turn out to be the probability density of the quan- 
tum field: 



J^l 2 + 5] - -rv Q f ) = o. (5.12) 

ot a 

It is noteworthy that the quantum effects indicated in the term con- 
taining h are proportional to h 2 . Thus, the real part of the Sclrroedinger 
equation (5.11) reproduces the classical Hamilton- Jacobi equation in the 
classical limit Ti — > 0 with quantum effects not appearing until the level of 
Ti 2 . In the limit of classical correspondence one sees that quantum phase 
corresponds to the classical action; but the amplitude of the wave function 
and the imaginary part of the Sclrroedinger equation have no classical cor- 
respondence. They describe purely quantum phenomena which lose their 
physical significance in the classical limit. The real part of the wave func- 
tion, however, does not lose its significance in the classical limit. It is the 
classical action. 

Action, at a deep level, is phase. This is to be expected; for at root 
the action represents the distance a particle travels along a path. If the 
particle is replaced by a wave (or an ensemble of waves), the position along 
alternate crests and valleys of a wave are described by its phase. Since the 
wave is arrayed along a path, it is appropriate that the distance traversed 
along the path becomes the phase of the wave. 



The Quantum Field and Quantum Probability 

Quantum mechanics preserves the classical invariants of motion. But 
quantum mechanics also opens us to new invariants unique to the quantum 



gradient of the wave function and its conjugate, 

|*| 2 V a S = Vctf), 



and that 



V Q 2 * = (i'I'/?i|^| 2 )V Q -(|^| 2 V a S) - (*/h 2 )V a S-V a S- (^/|*|)V q 2 |*|. 
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realm. The premier invariant of quantum motion has no classical counter- 
part. It springs from that uniquely quantum creation, the wave function *L 
itself. It is the probability that nature will be found in a state with precisely 
this wave function. Let us see how it comes about that the probability of a 
state is the central invariant of quantum motion. 

An invariant is the manifestation of an underlying symmetry. The fun- 
damental quantum symmetry is the openness of nature to quantum waves 
of all phases. Just as there is no unique origin for classical point particles, 
there is no unique phase for a quantum wave. The state *L differs from the 
state 

<L' = e iA T 

by the phase A, the symmetry parameter of the transformation. These two 
states cannot be distinguished since 'I-'' = j>Lj and it is this quantity (not 
T itself) which is observable. 

Quantum mechanics reveals nature’s fundamental physical reality is the 
quantum field *L, not a point particle’s coordinates. The particle is a man- 
ifestation of a sharply localized field in the classical limit. But this field is 
not directly observable. It is the amplitude I'Ll which is observable. The 
inaccessible phase therefore manifests a certain fundamental indeterminism 
in the quantum world. Although the full details of quantum waves are not 
open to us, statistical information about these motions is available. The 
square amplitude I'Ll 2 turns out to be the bearer of this statistical infor- 
mation and has become to be known as known as the probability density 
of the field. The probability density may be used to build up statistical 
information about the inaccessible motions. The most fundamental source 
of information on statistical processes is the expectation value. It represents 
the average of a run of observations one makes on a variable which is ob- 
served imperfectly. The expectation value of the position x of a particle in 
a state >L is 

(x) = j 'S>*x'S>d 3 x. 

The general rule for finding the expectation value of any mechanical quan- 
tity A is postulated to be 



{A) 



'LM'Ld 3 :*:. 



(5.13) 



This connection was first discerned by Max Born in 1924; and it is the 
feature of quantum mechanics Einstein accepted only tentatively and whose 
overthrow he awaited to the end of his life. Perhaps Einstein will in the 
end be vindicated by a new mechanics which captures the wave nature 
of matter with the spirit and continuity of classical mechanics. But it is 
more likely that the next mechanical revolution will flow from new, more 
refined ideas about uncertainty and determinism on both the classical and 
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quantum level. Einstein, whose reverence for the truth was constant, might 
rejoice in such a theory. We shall return to this theme in Chap. 7. 

The observed values (A) of quantum-mechanical operators A are real 
quantities. This means (A) = (A)*-, so Eq. (5.13) shows that quantum 
operators must satisfy 



^*A^d 3 x = / ^A*ty*d 3 x, 



a restrictive condition that operators, in general, do not satisfy. Let T and 
4> be any two wave functions operated upon by the same operator A. An 
operator A is said to be Hermitean if it satisfies 



J d’A'I' d 3 x 



d 3 a:, 



where the integrals are over all space. Hermitean operators give the same 
results when the functions upon which they operate are interchanged and 
the complex conjugate is taken. We see that by letting 4> = T*, the Her- 
mitean requirement is precisely what is required for quantum mechanical 
oerators if their observables are to be real quantities. 

All quantum-mechanical operators must be Hermitean. It may be di- 
rectly shown that the fundamental dynamical operators x, p, H , and M 
are all Hermitean. But operators that must be built up from them are 
not necessarily Hermitean when the mechanical variables are simply tran- 
scribed into operator notation according to the rules H — > ihd/dt and 
p —> —iUd/dx. A noteworthy example is the eccentricity 

e — p x M/km — r/r. 

The quantum-mechanical operator for e cannot be 

e = p x M/km — r/r 

because (surprise) p x M is not Hermitean. 

What is the axiomatic prescription for constructing operators which are 
built up as functions of the elementary dynamical variables? Remarkably, 
save for the requirement that they be Hermitean, there is none. There is 
an art involved in the formation of operators which are built up from the 
fundamental dynamical quantities x, p, t, and H. The artistic openness of 
quantum mechanics is another manifestation of an incompleteness in the 
theory. It is possible that the artistic freedom which requires us to frame 
anew each unique description of the microscopic world in quantum form is 
inherent in our interaction with nature. 

The invariant aspect of the probability density I’Ll 2 may be drawn out 
of the imaginary part of the Schroedinger equation (5.12). One discovers 
that this result is a conservation law, the probability continuity equation , 

d\*\ 2 /dt + ^V a -J a =0, 

a. 



(5.14) 
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where J a is the probability current , the flux of the probability density of 
particles of mass m a through a surface of unit area normal to J a : 

xTi 

Ja = x (5-15) 

2m a 

One may regard the current of any physical quantity as the product 
of the density of that quantity and its transport velocity. The probability 
current is similarly expressible in terms of a transport velocity — the particle 
velocity operator x = p/m = (—ih/m)V . It has the form 

j Q = ($i Q r + ri^)/2. 



The transport velocity for the probability is the Hermitean average of the 
particle velocity operator and the density TT*. 

The total probability contained in some region of space is j \^\ 2 d 3 x 
where the integral extends over the region of interest. The continuity equa- 
tion for the probability density implies that this quantity is an invariant for 
a bound system of particles. This can be seen by direct calculation using 
the continuity equation (5.14): 

j t J 1 4' | 2 d 3 x = j d jjfd 3 x=-Y J j V a -J a d 3 x. (5.16) 



The volume integral (5.16) may be transformed into an integral over the 
surface enclosing the particles, 



d_ 

dt 



/ 



|T | 2 d 3 x 




(5.17) 



where n is the outward-directed normal to the surface. The probability cur- 
rents J a and, accordingly, the right-hand side of Eq. (5.17) vanish outside 
the region of the particles. The total integrated probability is therefore an 
invariant: 

j t j iTj 2 d 3 x = 0. (5.18) 

A system of bound particles can be located with certainty within the 
entire space of their motion. If a probability value of unity represents 
certainty, then the probability density for bound particles must satisfy the 
condition 

J \v\ 2 d 3 x = i, («»> 



where the integral is over all space.* 



* In the case of particles free to move off to infinity, there does not exist a well-defined 
total integrated probability with which the state functions can be normalized such as that 
given by Eq. (5.19). This is because the probability current and the surface integral in 
Eq. (5.17) do not vanish at infinity where particles are present. Although an invariant like 
J |4/| 2 d?x for the total probability over all space does not always exist, the probability 
density always satisfies the continuity equation (5.14); and it is the continuity equation 
which is the more fundamental expression of the conservation of probability. A local 
increase in the density d\^\ 2 /dt comes at the expense of the current in the divergence 
Vq; ' Ja- 
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A variety of mechanical operators are appropriate to every physical sit- 
uation and each operator has a multiplicity of eigenstates 'h 1/ . The state 
function describing that situation generally contains all these eigenstates; 
for nature is rarely in a pure state, that is, one in which T = T,, for one par- 
ticular v of one particular mechanical quantity. Rather, the state function 
of a generic motion is a composite of all eigenfunctions, 

V 

where v now runs over all the eigenfunctions and the a v are weights that fix 
the amounts of each eigenstate to be superposed in the ensemble so as to 
describe the particular physical circumstance in which nature finds itself. 

For a system of localized particles, these eigenfunctions satisfy the nor- 
malization and orthogonality condition* 

j ^„%d 3 x = 5^. 



The orthonormality condition distinguishes the various eigenstates guaran- 
teeing that they are independent; for if f d 3 x ^ 0, then T„ and 

are not distinct states. 

The appropriateness of the expectation postulate (5.13) can be seen as 
follows. If a particle happens to find itself in a pure eigenstate of A, say T„ 
with eigenvalue A„, then A is a fully accessible variable since AT,, = A„T„ 
and Eq. (5.13) yields (A) = A„. 

On the other hand, if (as is more often the case) the particle is in a state 
that is not a pure eigenstate of A, then its state T consists of a superposition 
of all the eigenstates T„ of A: T = y) 1 ,a„T„. The expectation value will 
then not be a single eigenvalue of A but an average over many of them: 

<A> = 5>„| 2 A„. 

V 

From this it follows that the squares of the weights, |a„| 2 , represent the 
probabilities that the particle is in the state v. A mechanical quantity 
which approaches complete accessibility will have a sharply peaked prob- 
ability density about that accessible eigenvalue (the |a„| 2 of the accessible 



* The orthonormality condition may be verified by multiplying A^f v = by \&* 

and the complex conjugate A*\I/* = by \Ev, integrating over all space, and sub- 

tracting the results to produce 




V v d 3 x 



<1 QA*^d 3 x = (A v 




V v %d 3 x. 



Since A is Hermitean, the left-hand side vanishes and orthonormality follows. 
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eigenvalue approaches unity and all the other a„ vanish). On the other 
hand, a quantity which is not fully accessible will have probability density 
functions with appropriately less degrees of sharpness. 



The Algebra of Operators 

In the classical description of motion one uses ordinary algebraic relations 
between mechanical quantities. For example, the component IW 3 of the 
angular momentum is equal to (xiP 2 ~ X 2 Pi). It makes no difference whether 
one writes the products of x and p as X 1 P 2 or P 2 X 1 . This property of 
ordinary algebraic variables is that of commutativity. In ordinary algebra 
the variables in a product commute: AB = BA. In quantum-mechanical 
algebra, dynamical operators do not necessarily commute: AB 7 ^ BA. 

The failure of quantum operators to commute is rooted in the wave 
nature of the mechanics they describe. The operator AB corresponds to 
an observation of B followed by an observation of A. The operator BA 
corresponds to an observation of A followed by an observation of B. The 
commutator ( AB — BA) describes the interference of one observation with 
another. If ( AB — BA) = 0, nature will be in identically the same state 
if the observation of A follows that of B or that of B follows that of A. 
On the other hand, if ( AB — BA) 7 ^ 0, then A and B are mechanically 
so intertwined that the act of observing one of them interferes with the 
observation of the other. 

The duality between particles and waves brings a non- vanishing commu- 
tator into mechanics — a quantity that never arises in the classical regime 
revealed by Newton, Lagrange, and Hamilton. Nonetheless, this quantum- 
mechanical quantity is deeply rooted in classical mechanics. In a brilliant 
proposal (Note 7) which utilizes only the properties of Poisson brackets, 
P. A. M. Dirac suggested that the commutator of dynamical quantities is 
complex, proportional to h, and corresponds to the Poisson bracket of these 
quantities: 

(AB — BA) = ih[A, B] . (5.20) 

In the classical limit in which h — > 0 the commutator of mechanical 
quantities vanishes no matter the value of the Poisson bracket. This is 
consistent with the classical situation: operators become the observables 
themselves and the algebra becomes ordinary. Observation of one quantity 
has no effect on the observation of any other. 

The Dirac postulate connecting the Poisson bracket with the commutator 
of dynamical variables is the central bridge that carries classical mechan- 
ics into quantum mechanics. It directly reveals those dynamical variables 
for which wave-like interference is inherent. For if the Poisson bracket 
of a particular set of mechanical quantities does not vanish, neither does 
the quantum commutator of the operators representing those quantities. 
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Dirac’s postulate (5.20) is the touchstone between the classical and quan- 
tum realms — and perhaps the most beautiful mathematical statement in 
quantum mechanics. 

The most elementary non- vanishing quantum commutator is that of the 
canonical coordinates: position and momentum. Since the classical Pois- 
son bracket was shown in Chap. 3 to have the value [qi,Pj\ = Sij, the 
fundamental quantum commutator is 

(QiPj - PjQi) = ihSij. (5-21) 

The two primal vectors of motion — position and momentum — can never 
be simultaneously observed on the quantum level. Every canonical pair of 
coordinates possess a Poisson bracket which is unity. Canonical coordinates 
are so mechanically entwined by their symplectic union that they necessarily 
interfere with the observation of one another. Equation (5.13) presents us 
with one of the most important revelations of quantum mechanics: the 
uncertainty principle. It tells us that only half the canonical coordinates of 
a quantum particle may be exactly observed. 

If two quantities are to be simultaneously observed when the ensemble 
is in a given eigenstate, it is obvious that the operators corresponding to 
these quantities must have this eigenstate in common. Conversely, if two 
operators have the same eigenstate in common, the quantities which they 
represent may be simultaneously observed. Simultaneity of observation and 
commonality of eigenstates imply and are implied by one another. 

The operators of simultaneously observable quantities also commute. 
The operator equation for an observation of A followed by an observation 
of B for an ensemble in an eigenstate T of A is 

BAT = BA T. 

If A and B commute, then the identity can be written 

A(B'ft) = A(B'S'). 

But this requires that BT is T itself save for a multiplicative factor, or 
BT = BT. The multiplicative factor is therefore an eigenvalue of B and T 
must be an eigenstate of B as well as A. 

The nature of observation in the quantum realm may be summed up 
in the following principle: simultaneous observation of mechanical quanti- 
ties requires that their operators commute and that these operators possess 
eigenstates in common. 

The Achilles Heel of Quantum Mechanics 

The probabilistic nature of quantum mechanics is both its sine qua non 
and its Achilles heel. The state function is governed by the Schroedinger 
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equation and the probability embodied in |4'| 2 propagates according to 
the continuity equation (5.15) derived from it. These equations are fully 
deterministic! How then does probability enter? 

To answer this question one must recall that the probabilistic structure 
of quantum mechanics stems from the indeterminacy of quantum motions: 
the impossibility of observing half the physical quantities of nature exactly. 
The state function and the Schroedinger equation are ready-made quan- 
tum vehicles for propagating indeterminacy through space and time given 
an appropriate initial condition (these equations are first order differen- 
tial equations in time); but they in themselves cannot set up the initial 
condition that describes a particular mix of indeterminacy. There are no 
equations in quantum mechanics which can do this. 

The most useful way found thus far to complete the probabilistic struc- 
ture of quantum mechanics was advanced by Bohr and has come to be 
known as the “Copenhagen interpretation.” In it, probability is introduced 
through the act of observation — an activity which lies outside the formal 
mathematics of the theory. Probability in the present theory of quantum 
mechanics enters at the boundary of its mathematical structure as the initial 
conditions of the equations of motion. This happens every time one makes 
an observation. The evolution of nature governed by the Schroedinger and 
continuity equations is interrupted whenever an observation is made. (The 
state function is said to “collapse” at the instant of observation.) New 
initial conditions are simultaneously established by the act of observation. 
This new state function possesses a mix of indeterminacy different from 
that of the old. Some information is therefore lost; and it is through this 
loss that probability enters the theory. Nature then evolves from these new 
initial conditions until the next observation. 

Inadequate as this procedure may be from the viewpoint of the integrity 
of the theory, it has proved extraordinarily successful in dealing with the 
quantum mechanics of atoms, molecules, light, and nuclear particles. As 
long as one remains in the quantum realm, the Copenhagen interpretation 
works successfully. It is only when one attempts to leap from the quantum 
world into the classical world that paradox presents itself.* 

Symmetry and Invariants in Quantum Mechanics 

Symmetry, invariants, and degeneracy are profound aspects of mechan- 
ics. These features of motion are universal for both the classical mechanics 
of heavenly bodies and the quantum mechanics of the elements. While 
quantum mechanics introduces a new invariant unique to the quantum 



* Interested readers may wish to examine these paradoxes in J. S. Bell, Speakable and 
Unspeakable in Quantum Mechanics, Cambridge: Cambridge University Press, 1987 and 
J. A. Wheeler and W. H. Zurek (eds.), Quantum Theory and Measurement, Princeton: 
Princeton University Press, 1983. 
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world, the probability f j4'| 2 d 3 x, it also preserves the invariants and sym- 
metries of the classical world. Indeed, quantum mechanics reveals the even 
more vital role symmetry plays in our interaction with the world. Clas- 
sically we believe we observe the world quite independently of whatever 
symmetries are present. Quantum mechanics tells us that the very act of 
observation rests upon the symmetries with which nature presents us. 

How is observation dependent upon symmetry? The simultaneous ob- 
servation of any set of physical quantities requires that their operators com- 
mute or, classically, that they have vanishing Poisson brackets. But these 
are the Casimir sets of mechanics. Aspects of nature which can be simul- 
taneously observed must possess an invariance with respect to a particular 
symmetry of nature. The question of which quantities are observable with- 
out quantum uncertainty has therefore already been answered classically: 
they are the Casimir sets. 

The two Casimir sets of translational symmetry consist of all the ele- 
ments of the translation group itself, the components of P (or AT). All 
the components of P (or N) are therefore simultaneously observable. The 
Casimir set of rotational symmetry consists of the pair (M, M 3 ). It tells us 
that different angular momentum components may not be simultaneously 
observed. On other hand, any single component, say M 3 , and the total 
magnitude M commute and may be simultaneously observed. The simul- 
taneous observation of these two quantities clearly fixes the total magnitude 
of the remaining two components. But the manner in which that magnitude 
is split between these two components is inaccessible. 

One knows the Casimir sets are the quantum observables, but one ex- 
plicitly knows only the Casimir sets of the fundamental symmetries: trans- 
lations in space and time and three-dimensional rotations. There remain 
unknown invariants for general motions since the additional symmetries 
beyond the fundamental ones are unknown. There is one motion for which 
all the Casimir sets may be explicitly exhibited. This is the motion of two 
bodies; and it will shortly become the center of our attention. 

The manner in which invariants and degeneracy manifest themselves in 
quantum mechanics is different from their manifestation in classical me- 
chanics. In classical mechanics motion consists of the trajectory flow of 
particles in phase space. Each isolating invariant introduces a degeneracy 
into the motion in which the full phase space available to the trajectories 
degenerates into a submanifold. Increasing numbers of isolating invariants 
correspond to increasing degeneracies of the motion which restrict the tra- 
jectories to increasingly restricted submanifolds of phase space. For the 
maximum possible number of isolating invariants, the motion is completely 
degenerate resulting in a closed orbit for bound motion. 

In quantum mechanics it is not possible to observe trajectories. One 
observes instead the eigenvalues of the operators representing mechanical 
quantities. The phase space of the classical world is replaced by two spaces 
in the quantum world: the state space spanned by the eigenfunctions T,, 
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and its corresponding eigenvalue space spanned by the eigenvalues A v . It 
is these spaces that particles are free to explore in quantum motion. An 
invariant on the classical level leads to a degeneracy of the phase space. 
An invariant on the quantum level leads to a degeneracy in the state and 
eigenvalue spaces. Whereas a degeneracy on the classical level is manifested 
in a collapse of the general manifold in which the particle trajectories may 
lie to a submanifold, a degeneracy on the quantum level is manifested in a 
collapse of the general manifold of eigenvalues in which the ensemble may 
reside to a submanifold. As a result, different eigenfunctions, rather than 
corresponding to uniquely different eigenvalues, will correspond to the same 
eigenvalue. Roughly, one may say that a manifold of classical trajectories 
isolated by a degeneracy to the same orbit become quantum states isolated 
by that degeneracy to the same eigenvalue. 

Any invariant and the total energy necessarily have a vanishing Pois- 
son bracket and, by Dirac’s postulate, a vanishing quantum commutator. 
A vanishing quantum commutator with the energy is therefore connected 
with each invariant and its corresponding degeneracy. On the basis of the 
quantum identity (5.9) and Dirac’s postulate (5.20), it follows that any two 
invariants (e.g., the angular momentum components Mi, M 2 ) which each 
commute with the energy H but do not commute with one another possess 
distinctly different eigenstates which correspond to the same eigenvalue of 
the energy. Let 'Ll and T 2 be the eigenstates of Mi and M 2 . Since Mi and 
M 2 commute with the energy H they are also eigenstates of H and satisfy 

H'S>i = H'f>i, M 1 'L 1 = Mi’Ll, 

H 'L 2 = iDL 2 , M 2 'L 2 = M 2 'L 2 . 

But since Mi and M 2 do not commute, their eigenstates must be distinct: 
M 1 'L 2 yf MitL 2 , M 2 'Li y^ M 2 'L 1 . 

Further, each additional invariant that does not commute with any other 
invariant introduces a new level of degeneracy in which the additional eigen- 
states of these invariants again correspond to that same eigenvalue of the 
energy. The number of additional eigenstates for each eigenvalue depends 
upon the degree of degeneracy or, equivalently, the number of additional 
isolating invariants. These additional eigenstates are necessary to describe 
the additional invariants. But since any invariant commutes with the en- 
ergy, these eigenstates all correspond to the same eigenvalue of the energy. 

Our paradigmatic example is the motion of two bodies bound by the 
inverse-square force. The relative motion of these two bodies possesses a 
full set of isolating invariants: it is completely degenerate. In the motion of 
two heavenly bodies complete degeneracy is manifested in isolating invari- 
ants consisting of the energy, angular momentum, and eccentricity. These 
invariants generate a closed orbit lying completely within a single plane. 
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In the quantum motion of the electron about a nucleus complete de- 
generacy is also manifested in a complete set of identical invariants. But 
rather than generate a closed orbit, the operators corresponding to these 
isolating invariants generate multiple eigenstates of angular momentum and 
eccentricity which correspond to the same eigenvalue of the energy. 

In the quantum regime the dynamics of particles are no longer expressed 
in trajectories. Instead, particle dynamics are expressed in the eigenstates 
and eigenvalues which the ensemble may assume. As the determination 
of the trajectories of particles is the essence of classical mechanics, the 
determination of the eigenstates and eigenvalues of the particle ensemble is 
the essence of quantum mechanics. 

How can one find the quantum states that a system of particles may as- 
sume? Quantum states are the solutions — the eigenvalues and eigenstates — 
of the quantum identities (5.9) for any dynamical quantity of interest. For 
most quantities other than x and t these identities are differential equa- 
tions. Quantum states — eigenvalues and eigenstates — are the solutions of 
these differential operator equations. 

The states of the translational invariants whose Poisson brackets and 
quantum commutators vanish are completely accessible and the eigenvalues 
of their operators form a continuous spectrum. The Poisson bracket and 
quantum commutator of the linear momentum vanishes: [Pi, Pj\ = 0. The 
eigenvalues of each component Pi form a continuous spectrum — oo ^ P, ^ 
+oo. This is also true for the mass-center invariant N. The components 
of N are all simultaneously observable and their eigenvalues also form a 
continuous spectrum. 



Quantum Rotational Symmetry 

The states of the angular momentum M are different from those of P 
and N since different angular momentum components do not commute. 
Angular momentum commutes according to Dirac’s postulate (5.20) as 

(MiMj - MjMi) = ihe ijk M k . (5.22) 

One of the generous aspects of quantum operator equations is that their 
eigenvalues may be determined by algebraic rather than differential equa- 
tion methods. The quantum commutator is the key to the algebraic deter- 
mination of the eigenvalue spectrum of mechanical quantities. The commu- 
tator establishes the multiplicative rule of this algebra and operators called 
creation and annihilation operators are its central elements. The creation 
and annihilation operators have the property of generating new quantum 
states from old ones with incrementally more or less quanta. In so doing 
they generate, step-by-step, the complete lattice of states. Here is how they 
unfold the eigenvalues of vectors possessing rotational symmetry for which 
the angular momentum is the prototype. 
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As described in Chap. 4, the quantities 

M + = M 1 +iM 2 , M~ = Mi — iM 2 

are equivalent to M\ and M 2 ; however, their Poisson bracket with M 3 
results in self- rather than cyclic-reproduction. Using Eq. (5.22), it is an 
easy matter to show 



(M 3 M± - M ± M 3 ) = ±fiM ± . (5.23) 

It is this self-reproducing aspect of M + and M_ that gives them the prop- 
erty of creating or annihilating quanta of angular momentum when they 
act upon M 3 . That is why they are the creation and annihilation operators 
of rotational symmetry. Further, although M + and M_ do not represent 
quantities simultaneously observable with M and M 3 , their product does 
represent an observable since it is completely expressible in terms of the 
observables M and M 3 : 



M±M t = Ml + Ml ± hM 3 = M 2 - M| ± hM 3 . (5.24) 

These creation and annihilation operators may now be used to develop 
the eigenvalue spectrum of any vector possessed of the rotational symme- 
try (5.22) using the angular momentum as the illustration. Since angular 
momentum has the same dimensions as h, the eigenvalues of M 3 and M 2 
may be expressed as 

M 3 = rah , M 2 = fdh 2 , 

where m and (3 are pure numbers. The operator identities for the angular 
momentum are then 

.1/;; T j = ITlhSfr M fSrn = [3h 4/ fim- (5.25) 

Notice that is the same eigenstate for both indentities since the eigen- 
states of any two commuting operators are common to both. (The momen- 
tum component operators Mi and M 2 do not possess as an eigenstate 
since they do not commute with M 3 .) The determination of the eigenval- 
ues of the angular momentum reduces to the task of finding the permissible 
values of ?n and (3. 

The eigenvalues m and (3 may be determined by first showing that the 
states between which an angular momentum component such as M 3 may 
change are integral multiples of h. To see this, apply M + to the first of 
Eqs. (5.25): 

M + M 3 ^ 0m = mhM+^tpm. 



(5.26) 
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The operator product M + M 3 may be eliminated in favor of M 3 M + by use 
of the commutation condition (5.23). Equation (5.26) then becomes 

M 3 M + ' = (to + l)hM + 'S>p m . (5.27) 

This equation may be recognized as just the operator equation for M 3 , the 
first of Eqs. (5.25), with m advanced to m + 1, 



— (TO + (5.28) 

where M+'S’ p m is identified with 4 ’ p m +\. The characteristic role of M + as 
a creation operator is therefore revealed: 

M+'Sfpm = const x 4' / 3 m+ i. (5.29) 

The operator M + acting on the state T p m transforms it into the state 
v b/ 3 m+i (aside from a multiplicative constant) thereby demonstrating its 
property of creating a state with one more quantum of angular momentum 
than that upon which it operates. The eigenvalue of the state 4' p m +i 
differs from that of the state 'S’/s m by one quantum of angular momentum 
of magnitude h. 

A symmetrical analysis beginning with the application of the annihila- 
tion operator Af_ to the operator equation for M 3 reveals that Af_ removes 
one quantum of angular momentum from the state 4/ p m upon which it acts 
thereby creating the state 4' i g m _ 1 : 

M_ 4' i3m = const x 4' / 3 to _i. (5.30) 

The angular momentum states of a given component M 3 are therefore 



M 3 = mh 



with the values of ?n differing by integers. 

Now turn to the states of the magnitude M 2 . Its quantum number j3 
may be determined by noting that a component M 3 = mh of a vector of 
magnitude M 2 = /3h 2 can possess only a certain maximum magnitude. Let 
\m\ = j be the maximum value of this component corresponding to a fixed 
magnitude f3. The quantum number to therefore lies in the range 

~j < m < j. (5.31) 

Set to = j in Eq. (5.29), and find M+4'^ = const x 4(gj +1 . But j is 
the maximum value of the permitted states for to, so the state 'S’ pj+i must 
vanish identically thereby requiring 



M+'S’pj = 0 . 



(5.32) 
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The state ^ pj is not an eigenstate of M + , but an operator can be created for 
which 'fyfjj is an eigenstate by applying M_ to Eq. (5.32) thereby creating 
the operator M_M + : 

= 0. (5.33) 

The operator M_M + from Eq. (5.24) involves only M 3 and M 2 , both of 
which possess 4 ' pj as an eigenstate (recall M_M + represents a quantity 
simultaneously observable with M 2 and M 3 ). Equations (5.24) and (5.33) 
therefore yield 



M-M+^pj = \p - j(j + 1)] hHpj = 0, (5.34) 

and the eigenvalue /3 is given in terms of j by 

(3 = j{j + l). (5.35) 

Since p is given in terms of the more fundamental quantum number j, it is 
customary to use j to label the eigenstate as 'kjm- The angular momentum 
eigenvalues are 

M 3 = mh , M 2 = j(j + 1 )h 2 . (5.36) 

The permissible values of j may be determined by noting that for j= 0, 
it is necessary that m = 0 and the angular momentum vanishes identically. 
The first value of j for which two values of in exist differing by an integer 
(corresponding to one quantum of angular momentum) is j = 1/2 for which 
to = 1/2 and in = —1/2. Continuing with increasing j so that each suc- 
ceeding set of in states differ by an integer, one finds the permitted values 
of j arc j = 0, 1/2, 1, 3/2, .... 

It may seem that the determination of the quantum states of the an- 
gular momentum is a rather specialized exercise. But the result we have 
just obtained is a key that will unlock many doors; for the quantum states 
given by Eqs. (5.36) are ubiquitous in the physical world. They are the 
states of any rotationally symmetric vector. We have developed them us- 
ing the angular momentum as the illustration; but they are the quantum 
states of any vector S which satisfies commutation conditions of rotational 
symmetry: 

SiSj - SjSi = iheijkSk ■ (5.37) 

Such a rotationally symmetric vector has a single Casimir invariant — its 
magnitude S — and is characterized by the Casimir set consisting of the 
pair (5, S3). The quantum states of a rotationally symmetric vector are 

S 3 = mh, S 2 = j(j + 1)% 2 . (5.38) 

The number j, which specifies the magnitude, takes integer or half-integer 
values. The number to, which specifies one of the components, takes either 
the integer or half-integer values in the range — j ^ m ^ j. For each j 
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there are 2 j + 1 values of m permitted. For a given vector of magnitude 
S = \f j{j + 1 )h there are 2 j + 1 possible orientations, each one having 
a component S 3 of magnitude rnh where m takes either the integers or 
half-integers lying inclusively between — j and +j. 



Reflection Symmetry 

The gravitational and electrical force laws show that they possess an 
additional symmetry of motion beyond those of translations and rotations. 
These symmetries are space and time reflections in which the space and 
time coordinates are reflected through the origin: 



t' = -t. 

Reflection symmetry is a fundamental symmetry of nature common to 
both the classical and quantum mechanics of gravitation and electricity; but 
it plays no role in the classical motion of particles. On the other hand, it 
is an important symmetry in quantum mechanics. Reflections in quantum 
mechanics are observable and they generate new conservation laws. This 
difference between the classical and the quantum arises because classical 
particles are structureless points; they possess no mechanism for responding 
to reflections. A quantum particle, by contrast, is a superposition of waves. 
These waves are eigenstates 'F(x,f) with a rich mathematical structure 
which responds to space and time reflections 

The potential V of Galilean relativity given by Eq. (3.4) is unchanged 
by reflection of the space coordinates, x — > — x. It is also unchanged by 
reflection of the time coordinate t — > — t because it is independent of time. 
Reflections of the space and time coordinates also do not change the kinetic 
energy because space and time appear quadratically through the momenta. 
Since the Hamiltonian is a sum of the kinetic and potential energies it is also 
an even function of the space and time coordinates. The Hamiltonian of 
gravity and electricity in Galilean relativity possesses reflection symmetry: 
it is both space- and time-reversible. 

Reflections are discrete rather than continuous symmetries. The sym- 
metry parameters that describe them do not take a continuous range of 
values; rather, they are binary variables taking one of two mutually exclu- 
sive values. Reflections may be described by two binary quantities: the 
parity V which describes the response of mechanical quantities to reflec- 
tions of the space coordinates and the time-reversibility T which describes 
their response to reflections of the time coordinate. 

An eigenstate possesses definite values of V and T depending upon 
whether it is an even or odd function of space and time. An even function 
of the space coordinates, T(x,t) = T(— x,f), is said to possess even parity 
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signified by the value V = 1 whereas an odd function, ^(x, t ) = — \E'(— x, t), 
possesses odd parity signified by V = — 1. In like fashion an even function 
of the time coordinate, ^(x,!) = ^(x, — t), is time-reversible with T = 1 
and an odd function of time, ^(x,!) = — ^(x, — t), is anti-time-reversible 
with the value T = — 1. 

The operator V is the parity operator. When acting upon a state T(x, t) 
it inverts the space coordinates: 

■PT(x,t) = T(-x,f). 

The operator T is the time-reversibility operator. It inverts the time coor- 
dinate: 

ftf(x,i) = tf(x,-t). 

Parity and time-reversibility are temporal invariants. Once established 
by initial conditions, they never change. This is because they commute 
with the Hamiltonian, 



VH - HV = 0, TH -HT = 0, 



which is an even function of space and time. Notice, however, that although 
parity is a rotational invariant (it commutes with M ) time-reversibility is 
not and neither of them is a translational invariant (they do not commute 
with P and N). But the combined reflection of both space and time repre- 
sented by the operator VP is both a translational and rotational invariant. 

The eigenvalues of parity and time-reversibility are easily determined. 
First take the parity by examining the quantum identity for the operator 

v 2 -. 

'P 2 \\> = r p 2 \h . 

Now apply the parity operator twice to a state noting that this results in 
no change in the state: 

■p 2 v[/ = vjy 

Comparison of these two expressions shows that the eigenvalues of V are 

V = ± 1 . 

A given state function has a definite value of parity V = 1 or V = — 1. 
This value of parity is invariant: once fixed by initial conditions it cannot 
change. Identical reasoning for time-reversibility shows that the eigenvalues 
of T are also 

T = ±1 



and they are likewise invariants. 

It is easy to see that the Lagrangian L = T — V, like the Hamiltonian, 
is an even function of the space and time coordinates for the potential of 
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gravitation and electricity. This means that trajectories with both even and 
odd parity and time-reversibility satisfy the same law of motion. Nature 
apparently has no preference for one over the other. The time-reversed 
trajectory of any motion in which “all runs backward” is as permissible 
as the forward running trajectory. The law of motion of gravitation and 
electricity is time-reversible. This is not so in the nucleus; the potential 
of the weak nuclear force, unlike that for gravity and electricity, is not 
time-reversible. It is the only known aspect of nature that violates time- 
reversibility at a fundamental level. 

Our experience of the world is one of time-irreversibility: time has a 
unique and vivid direction and the world seems to move inexorably in that 
direction. How can this be? There is no compelling evidence that the time- 
irreversibility of the weak nuclear force in and of itself is the explanation 
for the time-irreversibility of the large-scale world. The time-reversibility 
of gravitation and electricity which govern the large-scale world when the 
world is manifestly not time-reversible is one of the great unsolved problems 
of physics. 



Hydrogen: Prototypical Atom, Primal Element 

We now come to the quantum rendering of the two-body problem: the 
motion of an electron and a positively charged nucleus bound by the electri- 
cal inverse-square force. The classical mechanics of the heavens describes 
the trajectories of two heavenly bodies. The quantum mechanics of the 
atom describes the eigenvalues and eigenstates of two microscopic parti- 
cles. The bound motions of these two particles constitute hydrogen, the 
prototypical atom and the first element of the Periodic Table. 

The quantum motion, like the celestial motion, may be decomposed into 
a center of mass component and a relative component. The center of mass 
translates uniformly through space at a constant velocity. This motion is 
one of a free particle whose mass is equal to the total mass of the electron 
and the nucleus. It possesses the translational invariants of the motion: the 
linear momentum P and the mass-center invariant N. 

It is in the relative motion that the structure of the hydrogen atom is to 
be found just as the motion of the planets was found in the relative celestial 
motion. The invariants of the relative motion are the energy TL , angular 
momentum IVf, and the eccentricity — either in the dimensionless form e 
or in the form of the eccentrum E = he where h = ^k 2 y/{—2TL) is the 
Kepler constant. 

The leap into the quantum world is made by establishing the appropri- 
ate operators for the hydrogen atom invariants TL , M, E. The operators 
TL and IVf are elementary Hermitean operators. But the eccentricity and 
eccentrum do not directly provide a prescription for their operator repre- 
sentations. They are quantities built up from the canonical coordinates 
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r and p for which the direct transcription of the classical expression into 
operator form turns out not to be Hermitean. 

Another way of looking at this difficulty is that the transcription of the 
classical expression for the eccentricity is not unique. One possibility is 



e= pxM/kp — r/r. 


(5.39) 


But an equally possible form is 




e = —M x p/kp — r/r. 


(5.40) 



Both forms are identical in the limit in which p and M become classical 
quantities. However, as operator expressions they can only be identical 
if they are simultaneously observable. The two operators must therefore 
commute. But p and M do not commute since the Poisson bracket of 
any vector with the angular momentum cyclically reproduces the vector 
yielding the commutator 

(p x M) — (-M x p) = i2Tip. (5-41) 

The two potential operators (5.39) and (5.40) are not the same. They differ 
by the factor i2hp. 

The artistic choice of creating an Hermitean operator using both order- 
ings of p and M by taking the average of the two possibilities has proved 
to give results in exact accord with the atomic world: 

e = (pxM — Mx p)/ 2k p — r/r. (5-42) 

Using the commutator (5.41), the eccentricity operator (5.42) may be ex- 
pressed as 

e = (p x M — ihp)/kp — r/r. (5.43) 

The eccentricity operator (5.43) is the image of the classical eccentricity 
(4.7) but with the additional quantum term involving ihp. 

Only four of the six components of M and e are independent since they 
satisfy the symmetry conditions (4.8) which can be expressed as 

e 2 = 1 - (£oh 2 )“W, M • e = 0, (5.44) 

where £q = —k 2 p/2U 2 will be shown to be the ground state or minimum 
energy level of the hydrogen atom. These classical symmetry equations 
become quantum operator equations for e 2 : 

e 2 = 1 — (£qTi 2 )~ 1 'H(M 2 + h 2 ), M • e = 0. (5.45) 

It is a notable fact that M 2 in the first of Eqs. (5.44) appears in the form 
M 2 + h 2 in the first of Eqs. (5.45), its operator counterpart. Readers may 
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wish to verify that when the operator e of Eq. (5.43) is squared, additional 
quantum terms appear which are not present in the square of the classical 
eccentricity e.* The net effect of these quantum terms is to evolve M 2 
in the classical expression (5.44) into M 2 + h 2 in the quantum expression 
(5.45). 

The form of the eccentricity invariant which is most useful in reveal- 
ing the structure of the hydrogen atom is the eccentrum. The symmetry 
equations (5.45) in e may be transformed into operator equations in E. 
Since 7 i commutes with e and E, the squared operators are related as 
SoTi 2 e 2 = E 2 H. Equations (5.45) accordingly become the operator coun- 
terparts of Eqs. (4.11), 

{M 2 + E 2 + h 2 )H = £ 0 h 2 , M-E = 0. (5.46) 

The hydrogen atom is described by the invariant operators IVf, E, and 
TL. The mutual interaction of these invariants gives the atom its structure 
which in turn gives hydrogen its peculiar chemical and physical properties. 
This structure originates in the unique way in which symmetry orchestrates 
the invariants through the commutation or Poisson bracket algebra. These 
symmetries are summarized by the three Casimir sets for two-body motion 
with inverse-square forces: 

0 H,M,M 3 ), (W,M,Mi), (H,E 3 ,M 3 ). 

The magnitude M is the Casimir invariant of three-dimensional rotations 
in configuration space, a symmetry which persists in the presence of the 
wider four-dimensional rotational symmetry in phase space. It therefore 
commutes with all the components of M as well as with their magnitudes 
[see Eqs. (4.10) for the full symmetry algebra]; but it does not commute 
with the components of E because E is not a Casimir invariant of three- 
dimensional rotations. 

The observability of the eccentrum magnitude E 2 (or eccentricity e 2 ) 
turns on the question of the observability of H and M since it depends 
only upon these two quantities according to the first of either Eqs. (5.45) 
or (5.46). It can be seen that E 2 is actually a member of the first Casimir 
set though it does not appear explicitly in the triplet ( Tt , M, M 3 ) because 
the set is identified with only three independent invariants. Its presence 
is indicated by the presence of H and M. The magnitude E 2 (or e 2 ) is 
observable along with (Tt,M,M 3 ). 

In classical two-body motion a state of relative motion with three de- 
grees of freedom is specified by six invariants (/3, 1 ) where (3 and / are each 



* In the square of the quantum eccentricity, e = (px M—ihp)/kp—r/r, noncommuting 
products generate the commutators ih[(p x M)i, Xi/r] and ti 2 [pi , Xi /r] . The first is 
2 jp • r/r and cancels an identically occurring product. The second involves the identity 
\pi)Xi/r] = 2/r and combines with h 2 p 2 /k 2 /i 2 to create (— 2h 2 /k 2 p)(p 2 /2p — k/r) = 
H/So- 
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Casimir sets. In quantum two-body motion the invariants specifying a 
state must all be simultaneously observable. The invariant operators must 
therefore all commute. But the operators corresponding to the six invari- 
ants (f3,I) of the relative motion do not all commute since Iff, If = Sij. It 
is not possible to specify six invariants in quantum motion. We have come 
face to face with the uncertainty principle: half the invariants of quantum 
motion cannot be observed when the other half are specified. 

What then is the quantum situation? The uncertainty principle requires 
that the quantum motion of two particles be cast in a radically different 
way from the classical motion of two heavenly bodies. Here lies the rift 
in the otherwise intimate correspondence between the quantum and the 
classical worlds. A classical state of relative motion is specified by six 
invariants. A quantum state of the hydrogen atom is specified by only three 
of these six invariants. There are three possibilities which correspond to the 
three Casimir sets (ft,M,M 3 ), (fH,M,M{), (ft,E 3 ,M 3 ); however the first 
two are essentially the same motion with different labels. They are both 
vector representations of two-body motion. The uniquely different states 
are specified by either the triplet (Tt, M, M 3 ) or the triplet ( Tt , E 3 , M3), the 
first the vector representation, the second the spinor representation. 

In the vector case (Tt, M, M 3 ) the eccentrum components are not ob- 
servable (though the magnitude of the eccentrum is observable because E 2 
commutes with M and M 3 ). In the spinor case (Tt, E 3 , M 3 ) the eccentrum 
component parallel to an observable angular momentum component is ob- 
servable; but the total magnitudes of the angular momentum and eccentrum 
are not. 

The sets of state specifications ( Ti,M,M 3 ), or (' Tt,E 3 ,M 3 ) are direct 
images of the separation constants of the Hamilton- Jacobi equation in the 
two symmetric coordinate systems (spherical-polar and parabolic) of the 
classical motion. For each of the state specifications one wishes to know 
the actual structure of the hydrogen atom. This information is contained 
in the observable values of the invariants — the eigenvalues of the operators 
(Tt, M, M 3 ) on the one hand and (Tt,E 3 ,M 3 ) on the other. Let us now 
draw out these eigenvalues. 

The States of Hydrogen 

The ground state energy level is the fundamental eigenvalue of the hy- 
drogen atom. This is the minimum energy available to the relative motion 
of the electron about the nucleus. The ground state energy of the motion 
of one heavenly body about another found in Chap. 4 is 

£ 0 = —k 2 g/ 2 M 2 . 

In classical motions this energy is unbounded from below since the angular 
momentum can take arbitrarily small values for which the trajectory be- 
comes progressively a highly elongated ellipse passing closer and closer to 
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the center of mass. The distance of closest approach of the two bodies is, 
from Eqs. (4.48) and (4.49), r m j n = M 2 / 2k/j and is proportional to M 2 . In 
the classical motion of heavenly bodies there is no restriction on the mag- 
nitude of the angular momentum. Hence, as M — > 0, the distance of closest 
approach has the limit r m i n — > 0 and the two bodies are free to collapse 
into one another. 

In quantum two-body motion a new constant fr has appeared bearing the 
dimensions of angular momentum. It establishes an irreducible minimum 
energy level which does not exist in the classical motion of two heavenly 
bodies. This minimum energy barrier prevents the electron from collapsing 
into the nucleus. The ground state energy eigenvalue of the hydrogen atom 
will be shown to be 

£ 0 = -k 2 n/2h 2 . (5.47) 

In the ground state energy of the hydrogen atom, Planck’s constant, the 
fundamental unit of action and angular momentum, replaces the angular 
momentum M which appears in the classical ground state energy. 

The ground state energy of two heavenly bodies depends upon the initial 
conditions of the motion. However the ground state energy of the hydrogen 
atom is completely expressible in terms of fundamental physical constants, 
the electron and proton mass and charge and Planck’s constant. It is there- 
fore a universal constant independent of particular initial conditions. 

For generic quantum motions, one must solve the Sclrroedinger equation 
to determine the eigenvalues and state functions of the motion. This is the 
quantum counterpart to solving the Hamilton-Jacobi equation of the clas- 
sical motion. But in the case of the hydrogen atom, the eigenvalues of its 
operators can be completely determined without solving the Sclrroedinger 
equation. The reason for this exceptional occurrence is the symmetry of 
two-body motion with inverse-square force. The four-dimensional rota- 
tional symmetry of the hydrogen atom is the Rosetta stone for deciphering 
its eigenvalues. The eigenvalues of the hydrogen atom were first deduced 
from the invariants of four-dimensional rotations by Wolfgang Pauli in the 
mid-1920s before the Sclrroedinger euqation had become established.* 

The ground state energy eigenvalue £q and the full set of eigenvalues 
of the invariants flow from the algebra of their commutation relationships. 
The eigenvalues of the energy 7i, the angular momentum M 2 , the eccen- 
trum component E 3 , and the angular momentum component M3 may be 
represented in terms of quantum numbers n, l, q , and m: 

£ = £ 0 / n 2 , M 2 = l(l + l)h 2 , E 3 = qh , M 3 = mh. (5.48) 

The determination of the eigenvalues of the invariants then reduces to the 
determination of the quantum numbers (n,q,m) and ( n,l,m ). We already 

* W. Pauli, “Uber das Wasserstoffspektrum vom Standpunkt der neuen Quanten- 
mechanik,” Z. Phys. 36, 336-363 (1926); English translation in Sources of Quantum 
Mechanics, B. L. Van der Waerden (ed.), New York: Dover Books, 1968, 387-415. 
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know the quantum numbers m and l for the rotationally symmetric vector 
Ad. They are given by Eqs. (5.36). But we do not yet know the quantum 
numbers n and q nor do we know the relationships they bear to m and l. 

The four-dimensional rotational symmetry of the hydrogen atom may be 
made transparent in quantum motion just as it was in classical motion by 
utilizing the rotational operators S and D of the two families of two-splreres 
which are the projections of the three-sphere of four-dimensional rotational 
symmetry: 

S = (M + E)/ 2, D = (Ad-E)/ 2, (5.49) 

in terms of which Ad and E are 

M=S + D, E = S D. (5.50) 

The description in terms of S and D is equivalent to that in terms of M 
and E. A state of the hydrogen atom specified by {hi, E 3 , M 3 ) is the same 
as that specified by (H, S 3 , D 3 ). 

The symmetry equations (5.46) may also be transformed into operator 
equations for S and D and follow those for the classical motion (4.13) with 
the addition of a uniquely quantum term which appeared in squaring the 
eccentricity operator: 

(S ' 2 + D 2 + h 2 /2)H = £ 0 h 2 / 2, S 2 — D 2 = 0. (5.51) 

As in the classical motion, the magnitudes S 2 and D 2 are identical and one 
of them, say D 2 , may be eliminated. Since all the operators in Eqs. (5.51) 
commute, they may be represented in terms of their eigenvalues, £ = £o/n 2 
and S 2 = D 2 = s(s + 1 )% 2 . The first symmetry condition of Eqs. (5.51) 
them becomes solely an equation in n and s: 

2 s(s + 1 ) + 1/2 = n 2 / 2 , 



or, 

n = 1 + 2s. (5.52) 

We see that the quantum number s of the rotationally symmetric spher- 
ical projections from the three-sphere is the master quantum number of the 
hydrogen atom orchestrating all others. The energy quantum number n is 
given in terms of s by Eq. (5.52). 

The angular momentum and eccentrum quantum numbers also flow from 
the quantum number s of the symmetry spheres. The S 3 and D 3 eigenval- 
ues, as components of rotationally symmetric vectors given by Eq. (5.38), 
range over values set by the master quantum number as 



S 3 = crh, 



D 3 = Sh-, 



—s^a,5^s. 



(5.53) 
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The angular momentum and eccentrum quantum numbers then follow from 
Eqs. (5.50) as 

m = a + 6, q = a — 5, (5.54) 

where a and S take either the integer or half-integer values lying between 
— s and s. 

The hydrogen states therefore flow from the quantum numbers of the 
symmetry spheres as follows. One passes through all values of the master 
quantum number s taken from the sequence s = 0, 5, 1, . . . . The corre- 

sponding values of a and 5 then range over the set — s ^ a, S ^ s for each 
value of s. In this manner one builds up all the quantum numbers n, q , and 
m according to Eqs. (5.52) and (5.54). In particular, one finds immediately 
from Eq. (5.52) that the quantum number n takes all the positive integers: 

n= 1,2,... . 

The energy level n = 1 corresponds to s = 0 and a = 5 = 0. Only one 
state exists, the ground state, for which q and m take the values 

<7 = 0 , m = 0 . 

The quantum number n = 2 corresponds to s = | from Eq. (5.52) and for 
which <7 and S take half-integer values from the set (— ^, |). Four states 
of q and m correspond to the four possible combinations of these values 
allowed by Eqs. (5.54): 

<7 = 0, m = ±1; q = ±1, m = 0. 

The quantum number n = 3 corresponds to s = 1 for which a and 6 
take integer values from the set (—1, 0, 1). These values may be combined 
according to Eqs. (5.54) to generate nine different states of q and m: 

q = 0, m = 0; 

<7=1 , to = — 1; q = —1, m = 1; 

(7 = 0 , to = ± 2; q = ±1 , to = ± 1; q = ±2 , to = 0. 

The quantum number n = 4 corresponds to s = | for which a and S 
take half-integer values from the set (—§,— 5, \ , §)• Sixteen states of q 
and to arise from these values: 



<7 = 


0 , 


m — 


±i; 


q = 


± 1 , 


m = 0 


q = 


± 1 , 


m = 


T2; 


q = 


± 2 , 


TO = =pl 


q = 


0 , 


m = 


±3; 


q = 


± 3 , 


m = 0 


q = 


± 1 , 


m = 


±2; 


q = 


± 2 , 


in = ±1 



Each energy level n possesses n 2 distinct states corresponding to the n 2 de- 
generacies brought about by the concerted action of the central and inverse- 
square natures of the electrical force. 
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Although the quantum numbers s, a, and 5 take both integer and half- 
integer values, the half-integer values generate quantum numbers n, q, and 
in which are always integer. Half-integer states of angular momentum and 
eccentrum do not appear in the hydrogen atom. 

The eigenvalues of the total energy £ = £ 0 /n 2 can now be used to show 
the intimate connection between the Kepler constant h and the Planck 
constant h. Use £ = £ 0 /n 2 = —k 2 /.i/2h 2 and eliminate the quantum ground 
state energy £ 0 = —k 2 /j,/2h 2 to find 

h = nh. (5.55) 

The Kepler constant is quantized in units of h. For low values of the 
energy quantum number n the discrete character of the Kepler constant is 
inescapable. Only in the limit n 1 does it acquire the semblance of a 
continuous mechanical quantity. 

The eigenvalues of the eccentricity may now also be determined. The 
eccentricity e is related to the eccentrum E by e = E/h. Its eigenvalues 
therefore follow from those of £3 = qh and h = nh: 



e 3 = q/n. 



(5.56) 



The eigenvalues of the hydrogen atom in the spherical-polar state given 
by the vector specification (TL, M, M 3 ) are now also known. The eigen- 
values of TL and M 3 are the same as those in the spinor state specification 
(TL, £ 3 , M 3 ). The angular momentum magnitude M replaces the eccentrum 
£3 in this specification. The eigenvalues of the angular momentum magni- 
tude M have already been exhibited in Eqs. (5.36). However only the inte- 
ger states of angular momentum appear in the hydrogen atom according to 
Eq. (5.52). The eigenvalues of the angular momentum are M 2 = 1(1 + 1 )fi 2 
and M 3 = mh, as given in Eqs. (5.48) where the l are the positive integers 
including zero and m has values The eigenvalues of the energy 

are TL = £q/u 2 ', however the angular momentum eigenvalues permitted de- 
pend upon the energy eigenvalues. This dependence may be found from 
the eccentricity magnitude e. 

The squared eccentricity operator is given by the first of Eqs. (5.45). 
The eigenvalue equation e 2 'P = e 2 T is therefore 



l-(£ 0 ft 2 ) _ 1 W(M 2 + ft 2 ) 



tk nlm 



— e tk n l m 



(5.57) 



The eigenstates 4 > n i m of e 2 are also eigenstates of the operators TL and 
M 2 . The eigenvalues of e 2 can therefore be directly read off Eq. (5.57) by 
replacing TL and M 2 with their eigenvalues:* 



1 + 1(1 + 1 ) 

n 2 



(5.58) 



* Note that if the energy and momentum eigenvalues are simply substituted into the 
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Since e 2 ^ 0, Eq. (5.58) shows that the angular momentum quantum 
number l and the total energy quantum number n must bear the relation- 
ship 

l ^ n — 1. 



For a given quantum number n, the quantum number l takes all the values 
0 ^ l ^ n — 1 and m takes all the values \m\ ^ l. For n = 1 the single state 
of l and to is 

1 = 0 , m = 0. 

For n = 2 there are four states of l and to: 

1 = 0 , to = 0 ; 

l = 1, to = 0; 1 = 1, m = ±1. 

For n = 3 the nine states of l and m are 

1 = 0 , to = 0 ; 

l = 1 , to = 0 ; l = 1 , to = ± 1 ; 

l = 2, to = 0; 1 = 2, m = ±1; 1 = 2, m = ±2. 

For n = 4 there are sixteen states of l and to: 

1 = 0 , to = 0 ; 

l = 1 , to = 0 ; l = 1 , to = ± 1 ; 
l = 2, to = 0 ; l = 2, m = ± 1 ; 1 = 2, m = ± 2 ; 

l = 3, m = 0; 1 = 3, m = ±1; 1 = 3, m = ±2; 1 = 3, m = ±3. 

As was the case for the parabolic states, the spherical-polar states of the 
hydrogen atom advance with energy level n in the sequence 1—4—9—16—- • -, 
in accord with n 2 degeneracies. Both state specifications of the hydrogen 
atom possess this fundamental property. 

The semi-major axis of the ellipse of classical motion is given by the 
first of Eqs. (4.54) as a = k/2\£\ and is a function only of the energy 
£. The semi-minor axis is given in terms of the semi-major axis and the 
eccentricity as b = a\J 1 — e 2 . A correspondence with classical mechanical 



classical expression for the squared eccentricity, the first of Eqs. (4.8), one obtains 

2 , 1 ( 1 + 1 ) 

e = 1 j — ’ 

n z 

an expression which differs from Eq. (5.58), the correct quantum result, by the term 
1/n 2 . Only in the limit of large quantum number n do the two expressions become 
identical. 
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quantities may be obtained by substituting the eigenvalues of Tt and those 
for e 2 given by Eq. (5.58) into these expressions: 

a = n 2 ao, b = \J\ + 1(1 + l)nao, (5.59) 

where ao = k/2\£o\ = ti 2 /ky = 0.529 x 10 _8 cm is the Bohr radius. This 
atomic radius was first proposed by Niels Bohr in 1913 in his simple quan- 
tum model of the atom in which the electron moves in a circular orbit about 
the nucleus. 

In the ground state n = 1, l = 0, m = 0, the mechanical quantities a, 
b , and e have the values a = ao, b = ao, e = 0. To the extent that it 
can be described in the language of classical trajectories, the ground state 
orbit has circular symmetry, consistent with Bohr’s original insight. Unlike 
the motion of a heavenly body (and unlike Bohr’s model), the electron 
possesses no observable orbital angular momentum in the ground state. 
The ground state electron is not a point particle moving on a circular 
trajectory. The electron is not confined to an orbit but is a wave distributed 
over the entire space surrounding the nucleus. In the ground state this 
distribution is completely symmetrical in angle. The most likely radial 
position of the electron (described by the probability density distribution 
I’l'nool 2 ) corresponds to r = a = n 2 ao; but there is always a non-negligible 
probability that an electron in a given state may be found anywhere in the 
space surrounding the nucleus. 

In the higher energy levels n > 1, no “circular” states exist in which the 
eccentricity vanishes except in the limit n — > oo. The minimum eccentricity 
corresponds to the maximum value of the angular momentum quantum 
number Z max = n — 1 and is given by 

2 = ( n ~ X ) 

^min o 

Tl z 

These are states for which e% = q/n = 0 and the eccentricity vector lies 
completely in the plane perpendicular to the polar axis. 



The State Functions of Hydrogen 

The invariant operators of the hydrogen atom generate the eigenvalues 
which are observable and measurable. But quantum operators beget not 
only eigenvalues; they also beget eigenstates which define the probability 
density distributions of the electron about the nucleus. These probability 
density distributions are the shapes of the electron waves corresponding to 
the different states. 

The shapes of the hydrogen atom electron waves are inherently three- 
dimensional. The classical relative motion of two heavenly bodies is es- 
sentially two-dimensional (the motion lying in a plane perpendicular to the 
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angular momentum vector), but the quantum relative motion of two bodies 
is always three-dimensional. There is always a finite probability that the 
motion may lie outside the plane perpendicular to any component of the 
angular momentum which one may specify, such as M 3 . This is because 
one cannot precisely specify the angular momentum; hence a plane perpen- 
dicular to the angular momentum vector in which the relative motion of 
the atom must take place cannot be precisely defined. 

The symmetries of Kepler motion are described by the vector Casimir 
set (H, M, M 3 ) and the spinor set (Tt, E 3 , M 3 ). If the atom is a vector state 
(' H , M, M 3 ), the coordinates that reflect this symmetry are spherical-polar 
coordinates r, <f>, 9 shown in Fig. 4-9. The coordinate lines in this system 
are the intersections of the surfaces of spheres r = const, cones 9 = const, 
and planes </> = const. 

If a state of the atom is specified by the spinor Casimir set (H, E 3 , M 3 ), 
the corresponding symmetric coordinates are the parabolic coordinates £, 
? 7 , <f> shown in Fig. 4-10. These coordinates are formed by the intersections 
of surfaces of paraboloids of revolution £ = const, rj = const about the x 3 
axis whose foci are the origin and by the planes <f> = const where <f> is the 
polar angle as in spherical-polar coordinates. 

The eigenstates of the hydrogen atom are the wave functions \E ’ n i m for the 
state specified by (' H,M,M 3 ) and the quantum numbers ( n,l,m ) or 'Ynqm 
for the state specified by ( Tt,E 3 ,M 3 ) and the quantum numbers ( n,q,m ). 
These wave functions generate the electron probability density distribution 
functions | 'F,u m | 2 and |'F„ gm | 2 . 

Separability of the Hamilton- Jacobi equation on the classical level is 
transformed into separability of operator equations on the quantum level. 
On the quantum level the wave function for the spherical-polar states is 
decomposable as the product 

'i ’nim{r,(j),9) = R n i(r)@i m (0)$ m ((j>), 
with a similar decomposition for the parabolic states 

'k nqm ~ nqm{j])^ mi.4*) * 

The three functions R n i(r), O i m {9), and $ m {4 > ) on the one hand, or X(£), 
Y(rf), and $> m ((f>) on the other, are determined from the three operator 
identities for (H,M,M 3 ) or (' H,E 3 ,M 3 ). A correspondence exists between 
the three separated Hamilton- Jacobi equations for the three partial actions 
on the classical level and the three operator equations for the actions on 
the quantum level. The Casimir sets contain the separation constants. The 
quantum numbers ( n,l,m ) or ( n,q,m ) appear as the separation constants 
on the quantum level. Both sets are ordinary differential equations which 
are integrable. The details of integration of the quantum operator equations 
may be found in Note 9 for the spherical-polar states and in Note 10 for 
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the parabolic states. The eigenstates must be normalized by the condition 

/ \Ir* vT> , d 3 X = 1 

The polar eigenstates $ m (q !>) are common to both the spherical-polar 
and parabolic eigenstates and are quite simple. The operator equation 



n im = mM/nl m 



reduces to 

d 

= rnh^ m {4>) 

with the solution 

= e im( l >+const . 

The normalization condition for the wave function determines the constant 
of integration resulting in 







It is interesting to observe that the hydrogen atom probability density 
for both the spherical-polar and parabolic states is independent of the polar 
angle 4 i. Since <3> m (</>)<!> = 1, the probability density is given by 



I *nlm\ 2 = Rnl(r)R* nl (r)Qlm6U0) 



for the spherical-polar states and 



I'f'nqml — ^nqmiO^nqrniV^nqmiv) 



for the parabolic states. On the other hand, the probability current J 
possesses only a polar component, 



J <l> 



2 \ dcj) dcf) ) 



mh 

2 fir 



|T| 2 , 



and is also independent of polar angle (though it does depend upon r). 
There is therefore no probability current across spherical surfaces r = const, 
nor is there a current across the conical surfaces 9 = const. Probability 
current only swirls across the planes 4> = const of Figs. 4-9 and 4-10 and this 
current is itself uniform in polar angle. The divergence of the probability 
current therefore vanishes and the probability continuity equation (5.14) 
therefore yields a time-independent probability density |’P| 2 consistent with 
the steady-state of the atom. 
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The components of the state function R(r) and 0(0) are not elemen- 
tary functions. They are particular kinds of polynomials in r which are 
forms of the confluent lrypergeometric function called Laguerre polynomi- 
als and another family of polynomials in 0 called Legendre polynomials. 
The spherical- polar eigenstates are determined in detail in Note 9 and a 
large collection of the lowest quantum number states are explicitly exhibited 
there. 

The probability density of the hydrogen atom for both the spherical- 
polar and parabolic states is always azimuthally symmetric: there is no 
distinguished azimuthal angular position (f>. This is not so for the latitudinal 
angle 9. The probability density does depend upon this angle because 
different values of the quantum numbers l and m reflect preferred locations 
in the latitude of the electron which yield the angular momenta specified 
by these values. On the other hand, there is no way to distinguish a “top” 
and “bottom” of the atom. The probability density is mirror-symmetric 
across the plane X 3 = 0, reflected in the condition | v f'(0)| 2 = |'I f (7r/2 — 0)| 2 
which one may verify for all states. 

The ground state, 



'I'ioo — 



— _L r ,~ 3 / 2 P - r / a o 



depends only upon the radial coordinate r and is spherically symmetric. 
All eigenstates fall off exponentially from the origin over a length scale 
equal to the Bohr radius uq. The higher quantum number states fall off 
more slowly with the factor e~ r / na °. All the eigenstates Tnoo in which the 
angular quantum numbers l and m vanish are spherically symmetric. For 
example, the state Taoo shown in Fig. 5-3 is 



'J / 300 — 



-L -3/2 

8lV^ a ° 




e ~r/3a 0 



In addition to falling off exponentially in r the higher states also exhibit 
wave structure in r. The number of nodes (points at which *1 ' n i m vanishes) 
increases with increasing quantum number n. The number of radial nodes 
for an eigenstate \E ' n im is equal to n — 1. While the ground state has 
no nodes — only a solitary wave which falls off in r — the higher states are 
radial waves that oscillate in space with increasing numbers of nodes as 
n increases. The wave functions become more intricate as n increases; 
but the intricate parts of these wave functions are of progressively smaller 
amplitude relative to the eigenstates of lower quantum number. 

The effects on the state function induced by the total angular momentum 
quantum number l are symmetric about the polar axis. The effects upon 
nim wrought by increasing angular momentum quantum number m are 
oriented about the polar axis and are mirror-symmetric across the plane 
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Figure 5-3. spherical-polar Eigenstates of the Hydrogen Atom. The state is represented 
as surfaces upon which probability densities | 'I' 300 1 2 are uniform. In addition to falling 
off exponentially in r, these spherically symmetric states also exhibit wave structure 
indicated by the multiple surfaces. The probability density |\t| 2 vanishes on two nodal 
surfaces which are located between the inner two and outer two surfaces. ( Courtesy 
George D. Purvis III) 

X 3 = 0 (the plane in which the corresponding classical two-body motion 
would lie). The angular momentum effects can best be seen for the case of 
maximum angular momentum for a given energy level, that is, states for 
which l = n — 1. Each of these states is distinguished by the polar angular 
momentum quantum number m ^ l. A sequence of such states is exhibited 
in Figs. 5-4 for the case l = 2 corresponding to the energy level n = 3. 
When \m\ = l, the angular momentum vector is aligned with the polar axis 
and the electron is most likely to be located near a plane perpendicular to 
the angular momentum vector. 

This condition corresponds most closely to the classical motion and the 
effect can be seen in the state T 3 2 ± 2 , 

*32±2 = 3/2 (-) e _r/3a ° sin 2 9e ±2 ^, 

162 sjp: \ a Q ) 

which is illustrated in Fig. 5-4 (a). The electron probability distribution is 
concentrated in the plane perpendicular to the polar or 3-axis. 

In Fig. 5-4 (b), the same states are shown, but with the polar component 
diminished to m = 1: 

^ 32± i = — V< 3/2 ( e_r/3a ° sin 9 cos Oe^*. 

81 a / 7 t \ao J 

The electron now is distributed over two torus-like surfaces and its net 
contribution to the polar angular momentum is correspondingly diminished. 
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Figure 5~4 (a). spherical-polar Eigenstates of the Hydrogen Atom. A probability density 
surface for the state 'T'32 +2 illustrates the case m = l. In this case the angular momentum 
vector is aligned with the polar axis (perpendicular to plane of the torus-like surface) 
and the electron is most likely to be located near this plane. This state corresponds most 
closely to the classical motion. (Courtesy George D. Purvis III) 

When the polar angular momentum is reduced to m = 0, the states are 
most unlike the classical motion. The electron must execute its motion 
in such way that it does not generate an observable polar component of 
angular momentum. An example of such a state is 

4 '“> = G0 

aud shown in Fig. 5-4 (c). One sees that the electron wave function pos- 
sesses upper and lower lobes which cancel the angular momentum on the 
central torus-like zone around the polar axis so that it possesses significant 
angular momentum components M\ and M 2 but vanishing M 3 . The inter- 
ested reader may examine the variety of structure in the full set of states 
up through n = 3 found in Note 9. 
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Figure 5~4 (b). spherical-polar Eigenstates of the Hydrogen Atom. A probability density 
surface for the state \I/ 32 ±i is shown. The electron is distributed over two torus-like 
surfaces. The angular momentum is maximal but no longer totally polar. ( Courtesy 
George D. Purvis III) 



The parity of the spherical- polar states of hydrogen turns on the quan- 
tum number l (see Note 9 for the details). The parity of a given state is 
given by 

P =(-!)'• 



The spherical- polar states illustrated thus far have all been even-parity 
states corresponding to even values of l. An example of an odd-parity state 
is 



T310 = 



- 3/2 



27 






e -r/3a 0 cos 



shown in Fig. 5-5. 

Consider now the set of eigenstates for the hydrogen atom in a spinor 
state specified by (TL, £ 3 , M 3 ) with corresponding parabolic quantum num- 
bers (n,q,m). The detailed determination of these eigenstates may be 
found in Note 10. The angle coordinate <j> and its corresponding rotational 
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Figure 5~4 (c). spherical-polar Eigenstates of the Hydrogen Atom. A probability den- 
sity surface for the state ^320 illustrates a state of maximal total angular momentum 
quantum number l = n — 1 but a vanishing polar component m = 0. The electron is 
distributed over upper and lower lobes and the central torus-like zone in such a way that 
the polar angular momentum vanishes. This state differs most from the classical motion. 
(Courtesy George D. Purvis III) 

invariant M 3 are common to both the spherical-polar and parabolic repre- 
sentations. The polar states <I> m (<^) = e 1 ™^ / \f2n in parabolic coordinates 
are precisely the same as the polar states in spherical-polar coordinates. 
The remaining parabolic states X nqm (t ;) and Y nqm (ij) are not elementary 
functions. These two functions are, however, formally identical, differing 
only in the sign of the quantum number q as shown in Note 10. The 
parabolic functions have the property 

X n qm — h n—qm ( V ). (5.60) 

The parabolic state functions are confluent hypergeometric functions simi- 
lar to the Laguerre polynomial. They are determined in Note 10. 

The parabolic ground state is 

*ioo = ^= a C /2e ~ (i+ri)/2ao - 
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Figure 5-5. spherical-polar Eigenstates of the Hydrogen Atom. A probability density 
surface for an odd parity state: the state 'J 310 . (Courtesy George D. Purvis III) 

This state is identical to the spherical-polar ground state. It must indeed 
be so since for n = 1 it is required that q = m = l = 0. The exponential 
fall-off terms all appear with the common factor (£ + ?;) / 2 nao ■ In parabolic 
coordinates the factor (£ + 77) /2 is the magnitude of the position vector r. 
The exponential fall-off terms are therefore of the form 

e -(£+g)/2na 0 _ e ~r/na 0 

and are identical to the exponential fall-off factors of the state functions 
4 ’nim in spherical-polar coordinates. Since these factors depend only upon 
the energy level quantum number n and this quantum number is common 
to both state specifications, the two factors are identical. Moreover, since 
the ground state is completely symmetrical in both state specifications, 
the ground state functions 4' 10 o are identical for both spherical-polar and 
parabolic coordinates. 

The spherical-polar and parabolic states actually overlap in an even 
more general way: they are identical whenever q = 0 for the parabolic 
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states and l = n — 1 for the spherical-polar states. Using the relationships 
£ = r(l + cos0) and rj = r( 1 — cos 9) it is an easy matter to show that 
the parabolic state ^ n om is identical to the spherical-polar state ’•I-hm-i-m- 
The states for which q = 0 and l = n — 1 are also the states most closely 
related to the classical motion; for in this case the eccentricity vector lies 
completely in the plane of motion perpendicular to the M3 component of 
the angular momentum. 

In contrast to the spherical-polar states, the parabolic states are not 
generally symmetric with respect to the plane X3 = 0. They are only so for 
states 'i'nqm with q = 0 as illustrated by 



^300 — 



-3/2 



9-v/7T 



1 - 



1 - 



0 -(£+>?)/6ao 



which is shown in Fig. 5-6 (a). 




Figure 5-6 (a). Parabolic Eigenstates of the Hydrogen Atom. A probability density 
surface for the state \1 / n qm = ^300 illustrates a parabolic state symmetric with respect 
to the plane x% = 0. The x% axis is perpendicular to the toroidal plane and the atom 
has been rotated toward the reader for better viewing. ( Courtesy George D. Purvis III) 
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Parabolic states with non-vanishing eccentricity q are asymmetric and 
are illustrated by the state 



'J / 3-20 







1 p-(C+»))/6ao 

18 \a 0 ) \ 



This state is shown in Fig. 5-6 (b). The full set of parabolic states up to 
n = 3 may be found in Note 10. 




Figure 5-6 (b). Parabolic Eigenstates of the Hydrogen Atom. A probability density sur- 
face for an asymmetric parabolic state: the state ^nqm = ^ 3 - 20 - The surfaces encircle 
the X 3 axis which is perpendicular to the planes of the flattened spheroid structures. 
(Courtesy George D. Purvis III) 

Parity manifests itself in parabolic states as an interchange symmetry 
as described in the discussion following Eq. (4.45). Inversion of the space 
coordinates interchanges £ and ?y. Since the eccentricity is a function of 
odd parity, the sign of its quantum number q is also changed on coordinate 
inversion. 
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The hydrogen atom — the two-body motion with inverse-square force ren- 
dered by quantum mechanics- possesses eigenstates which form a rich col- 
lection of three-dimensional structures. Quantum states stand in marked 
contrast to classical trajectories for which the orbit lies wholly in the plane 
perpendicular to the angular momentum vector. The eigenstates are in- 
tricate geometric shapes formed of exponential, trigonometric, and poly- 
nomial functions. They contrast dramatically against the classical motion 
for which the geometric structures are the circle and ellipse. This collec- 
tion of eigenstates of quantum two-body motion provides the basis for the 
manifold structure of the elements — structure not contained in the ellipse 
of classical motions. 

Mechanics of the Periodic Table 

The total number of eigenstates that correspond to a given value of n 
(and hence to a given energy level £ n ) increases in the pattern 1+4+9+- • • . 
This pattern closely corresponds to the pattern 2+8+ 18+ • • • of the Periodic 
Table. But not quite. Aside from the fact that the quantum pattern is 
off by a factor of two from the Periodic Table, there is no requirement 
that a nucleus with a given number of electrons would have its electrons 
occupying all these states. In fact, the likely condition would be one of 
the lowest total energy. All the electrons would seek to occupy the ground 
state n = 1,1 = m = q — 0. 

The rule which governs the permissible states that electrons may oc- 
cupy thereby removing the arbitrariness described above (and hence pro- 
vides the basis for the atomic description of the Periodic Table) lies beyond 
the inverse-square electrical force of the two-body problem. Although the 
inverse-square force law (3.4) is adequate for the basic description of the 
interaction between an electron and a positively charged nucleus, it is quite 
incomplete for a description of the quantum properties of the electron itself. 

This is the first glimmer of approaching crisis; for to inquire into the 
quantum properties of the electron is to plunge into the breakdown of the 
Galilean symmetries of the two-body problem. We need not yet face this 
crisis because the implications of the electron’s quantum properties for the 
hydrogen atom turn out to be quite simple. We can summarize them as 
follows. The eigenstates of the electron must be regarded not as a scalar 
but as a four-dimensional quantity, 



T = (tfo.tfi, ^ 2 , tf 3 ), 



consistent with the relativistic description of space and time as a four- 
dimensional space incorporating three space components and one time com- 
ponent. 

What meaning does one attach to the various components of a four- 
valued state ($o, ^1,^2, ^ 3 ) for a charged particle? Amazingly, these four 
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components represent two distinct charged particles: the electron and its 
antiparticle, the positron. In addition each particle has two new states 
peculiar to the quantum realm. Because of the intimate connection of 
these states with the angular momentum of the particle, these states are 
called spin states. Each spin state possesses only two eigenvalues: +%/ 2 
and -71/2, the fundamental eigenvalues of angular momentum. The four- 
valued state represents an electron with two spin states and a positron with 
two spin states, for a total of four states. 

The most significant result of the quantum theory of the electron for 
the structure of the atom is the Paidi Exclusion Principle. It requires that 
no two electrons can be in the same state (where the spin is one of the 
quantities defining the state in addition to the three quantum numbers of 
the Casimir set). 

The exclusion principle brings completion to the mechanical theory of 
the elements. Even though the ground state is the preferred state for a 
nucleus with an assembly of orbiting electrons, it can be occupied by two of 
them at most; and these two electrons must have opposite spins. Since two 
electrons (one of spin +U/2 and one of spin —%/ 2) can at most occupy each 
of the eigenstates 'i'nim for each unique combination of n, l , m, the pattern 

of eigenstates 1 + 4 + 9 H actually corresponds to electronic occupations 

of 2 + 8 + 18 + • • • which is the basic pattern of the Periodic Table. 

The emergence of the Periodic Table from the mechanics of the atom is 
one of the magnificent achievements of physics. It is the quantum coun- 
terpart to the equally magnificent emergence of the ellipse of Kepler from 
Newtonian mechanics of the solar system. They are the motions — the one 
classical, the other quantum — of two bodies bound together by the inverse- 
square force. 

The actual energy levels of a multi-electron atom will not quite follow 
the simple 2 + 8 + 18 + • • • pattern of the two-body atom because of the 
mutual interaction among electrons in addition to their interaction with 
the nucleus. Just as the elliptical orbits of the planets about the sun are 
perturbed by one another’s presence, so too, the pure quantum mechanical 
states of the electrons in the hydrogen atom are perturbed by one another’s 
presence. 

The atoms beyond hydrogen are many-body atoms. The approximate 
energy levels of multi-electron atoms, still using the ordering scheme of 
hydrogen, are shown in Fig. 5-7. It can be seen that there are two blocks 
of 8 states before the first appearance of the 18 states and then two blocks 
of 18 states consistent with the Periodic Table. Although the n = 3, l = 2 
state (4 , 32 m) h as a lower energy level than the n = 4, l = 0 state (4 , 4oo) 
when electron interactions are neglected, the situation is reversed in an 
actual multi-electron atom. 

This is because the inner electrons shield the positively charged nucleus 
making it appear to have a diminished charge to the outer electrons. Outer 
electron energy levels are therefore shifted upward from those which they 




5. Quantum Mechanics: The Elements 



185 



Energy 



% 4'500 


^P^P^P ^ 51m 


OOOOO 4/ 42m 


(9) 18 


O ^400 


OOO ' I ' 41m 


OOOOO 4^ 32m 


(9) 18 


O ^300 


OOOaim 




(4) 8 


l = 2 


O 4 / 200 


OOO ^21 m 




(4) 8 


l = 1 


O ^100 






(1) 2 



l = 0 

o o o o • 

n = 1 n = 2 n = 3 n = 4 n = 5 



Figure 5-7. Energy States and the Periodic Table. The bands correspond to states with 
approximately the same energy level. States with different principal quantum numbers 
n are identified by different shades. The states are arranged according to the angular 
quantum number l. The Periodic Table groups states according to energy level; hence 
the numbers of states of the atom follow the number of states within each energy band 
which are 1 + 4 + 4 + 9 + 9 + -- - and which correspond to electronic occupations of 
2 + 8 + 8 + 18 + 18 + -- -. 



would have if they interacted solely with the nucleus. In the n = 2 state, 
the l = 1 states are elevated above the ground state l = 0 because of the 
shielding effect of the two electrons in the ground state. The same effect 
shifts the l = 1 states for the n = 3 levels. At the n = 3 level the l = 2 
states are further shifted because of the shielding by both the two l = 0 
ground state electrons and the six l = 1 electrons. The cumulative effect 
of these shifts is to place the n = 3, l = 2 states slightly above the n = 4, 
l = 0 states so that they group themselves with the n = 4 , l = 0,1 states 
rather than the states of their own principal quantum number n = 3. This 
shifting effect creates two rows of 8 states before the onset of rows with 18 
states. These rows of 18 states are also double because of the same effect, 
as shown in Fig. 5-7. 
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The ground state energy of the primal element, \£q\ = kgi 2 /2h 2 , is a 
fundamental atomic constant based upon the electron mass and the hy- 
drogen nuclear mass. It has a value of 13.6 eV (the energy acquired by 
an electron if it is accelerated through a voltage difference of 13.6 V). Di- 
vision of this energy by the mass of the hydrogen atom yields an energy 
scale of approximately 50,000 British thermal units (BTU) per pound or 
100 million Joules (megajoules) per kilogram. This fundamental quantum 
mechanical energy level sets the energy scale of all chemical reactions. The 
energy levels of these eigenstates directly determine the chemical reactivity 
of the elements. It is the differences in energy levels that provide the basis 
for chemical changes in which electrons and nuclei rearrange themselves so 
as to achieve stable states of minimum energy. 

The energy that may be released by a pound of dry wood if it is burned 
is about 10,000 BTU. In burning, the electrons orbiting the carbon and 
hydrogen nuclei in the wood interact with the electrons and nuclei of the 
oxygen in the air to form carbon dioxide and water molecules which end 
up with an excess of energy. The excess energy arises because the internal 
energy of the water and carbon dioxide molecules is less than the internal 
energy of the hydrogen and oxygen atoms. This excess energy is converted 
by collisions of the carbon dioxide and water molecules and by radiation 
to heat. The energy released when a beam of stone, wood, or steel is 
fractured is thousands of BTU per pound of the fractured portion. The 
energy contained in a pound of food and transformed into motion and heat 
by our bodies is several thousand BTU per pound. The chemical energy 
level of fat is a few thousand BTU per pound. The wild goose transforms 
fat into propulsive motion with its wings, flying about a thousand miles on 
several pounds of fat. All these chemical transformations are rooted in the 
quantum-mechanical ground state energy |£ 0 |=13-6 eV. 

It is the energy level differences of atomic states (rather than quantum 
number differences) that give each element its peculiar chemical properties. 
Hence, the clustered groups of states in Fig. 5-7 which correspond to the 
same energy level, rather than levels marked by the principle quantum 
number n of the two-body hydrogen atom, demark the chemical properties 
of the many-body elements. The full quantum-mechanical problem of multi- 
electron atoms is a many-body problem just like that of the solar system. 
It cannot be solved exactly like the single electron hydrogen atom. 

Nonetheless, the hydrogen atom is the prototypical atom, the exactly 
soluble motion of quantum mechanics. Although the hydrogen atom is 
the only element for which an exact quantum- mechanical solution exists, 
mechanics nonetheless describes the motion of all the elements in their 
many-body complexity. Powerful approximate methods exist for solving 
these many-body motions. Computational methods in which the operator 
equations are solved numerically have also revealed inner structure of both 
high atomic number elements and the molecules which they form. But 
these developments are beyond the domain of the two-body problem. 
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Reprise 

A long journey spans the distance from the Ptolemaic world view to the 
quantum mechanics of the atom. Two milestones mark turning points on 
that journey: Newton’s inverse-square law which produced the ellipse of Ke- 
pler from a fundamental law of motion and Bohr’s quantum transformation 
of that same inverse-square law to create the atom. The harvest is extraor- 
dinarily rich. We now stand before one of the magnificent vistas of physics 
to which the Shaggy Steed of Physics has carried us: the heavens and the 
elements governed by a universal law of motion and the same symmetries. 
A complete set of isolating invariants formed from the conservation laws 
and completed by the eccentricity invariant for the inverse-square force of 
electricity and gravitation portrays this unity. Its underlying symmetries 
and invariants are those of four-dimensional rotations. From these invari- 
ants flow the fundamental orbit of heavenly bodies on the one hand and 
the fundamental pattern of the elements on the other. 

Quantum mechanics, compelling as it is, can only be an approximation 
to a more complete theory of mechanics yet to be conceived. Since the 
probabilistic character of quantum mechanics arises because of the impos- 
sibility of observing half the invariants of motion exactly, half the invariants 
of classical motion rest upon an ultimately uncertain footing deep at the 
quantum scale. Yet the large-scale world in which we live and move pos- 
sesses all these invariants! How do these uncertain invariants get assigned? 
There is no more telling sign of the present incompleteness of quantum 
theory than that there is no way presently known to bridge the uncertainty 
gap between the quantum and the classical without paradox.* 

We are always open to Ptolemaicism — seeing the simple in needlessly 
complicated ways. The centerpiece of quantum theory, the state function 
(“die Psi funktion” as Einstein always called it, denying it permanent sig- 
nificance beyond its symbolic form), faces us with this vulnerability. The 
central object of the theory is not observable. Rather, it is an intermediate 
construct on the way to the probability density I'Ll 2 which is the observable. 

Are these not Ptolemaic burdens, quantum versions of epicycles and def- 
erents that we bear because we cannot yet see the way out of a needlessly 
obtuse perspective? Ptolemaicism, with its quite credible predictions of 
actual orbits, reminds us that accurate predictions of atomic structure af- 
forded by quantum mechanics cannot alone guide us to the deep truth of 
the theory. Like Greek and medievalist, we too are limited by the vision of 
our era. 

Why, one must finally ask, why this omnipresent, inverse-square force 



* Unique perspectives on the paradoxes of quantum mechanics and an imaginative 
solution to the present incompleteness of the theory based upon gravity are offered 
by Roger Penrose in 300 Years of Gravitation, S. W. Hawking and W. Israel (eds.), 
Cambridge: Cambridge University Press, 1987, 17-48 and The Emperor’s New Mind, 
Oxford: Oxford University Press. 1989, Chap. 6. 
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in the large-scale world that closes orbits, makes electronic states degen- 
erate, and creates elements with rich structure thereby giving the world 
pith and texture? Can there be a more dramatic portrait of the unfold- 
ing of the one into the many than that of charge and mass bound by the 
inverse-square force? For out of such nakedly simple attraction born out of 
a primal bloom there arise, in coalescing complexity, elements, molecules, 
planets, geological structures, atmospheres, and the entire evolution of liv- 
ing creatures — the vast portion of the creation that surrounds us. And 
if the world is splendid, this simple force mediating mass and charge is, 
indeed, a splendid stroke of creation. 

No one has yet found a credible vision of the quantum that naturally 
merges into the classical or dispenses with the unobservable state function. 
But deeper insight into the pervasive nature of the inverse-square force has 
been found. It is Einstein’s vision of general relativity. It is splendid. 




Chapter 6 



The Hidden Unity of Space and Time 



We are approaching the limits of our journey with the two-body problem. 
The mechanics that cast asunder the ancient cosmos has borne us upon this 
journey. Looking back, one finds the static heavenly spheres gave way to 
a universal law sustaining all motion: the universe unfolds along the paths 
of least action of all its matter. The diversity and richness of the heavens 
and the elements are born out of the unity and simplicity of electrons and 
nuclei bound by gravity and electricity. 

For two heavenly bodies the law of motion yields the classical ellipse of 
Kepler — and more: heavenly tori image heavenly motions in a simplicity 
and beauty more revealing than that of the spheres of antiquity. Extended 
into the quantum realm, the same law of motion describes the structure 
of the elements; and the prototypical structure of the elements is found 
to possess the same symmetries as the prototypical structure of the solar 
system. 

The heavens and the elements are mathematically united by the same law 
of motion and the same underlying symmetries. The external beauty of the 
world is accompanied by an interior beauty of invariants and symmetries in 
which the perfection imagined in the heavenly spheres pervades the whole 
of space and time. The music of the spheres and the elements now resounds 
in the four-dimensional rotational symmetries embracing the solar system 
and the atom. 

This vision is only a partial vision of a world layered in mystery. The por- 
trait of the heavens and the elements drawn from the Galilean and Newto- 
nian mechanics of gravity and electricity also shatters when pushed beyond 
its human-scale limits. Our portrayal of the heavens and the elements — like 
those of the past — is only a glimpse of a deeper and more embracing drama 
which continues to unfold. 

In these final chapters the two-body problem reveals its limitations and 
in so doing it opens us to a wider world of physics. We shall first be con- 
fronted with the breakdown of Galilean symmetries and their displacement 
by the even more graceful symmetries of Einsteinian relativity. (Remark- 
ably, the Einsteinian symmetries of space-time turn out to be the same as 
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the symmetries of Galilean two-body motion when the motion is unbound.) 
Then we shall be introduced to the new features of motion which appear 
when more than two bodies are allowed to interact, the rich world of chaos 
and cosmos which appears in many-body motions. 



Einsteinian Symmetry 

Space and time are not quite what we have made of them. The position 
of a point in space, the instant of a point in time — these seem to be the 
bedrock of experience, the absolute ground upon which all descriptions of 
the dynamics of the world must be built. But they are not. They only 
appear to be so for velocities which are small compared to the velocity of 
light. For the full range of velocities, the primal mechanical quantity is not 
position, nor is it time. The fundamental mechanical quantity of the world 
is a velocity , the velocity of light. 

We say “of light,” but it is more than this. It is the velocity with which 
both gravitational and electrical interactions are propagated from particle 
to particle. The electrical force is propagated from particle to particle at 
the speed of light. The gravitational force is propagated from particle to 
particle at the speed of light. The maximum velocity with which the sub- 
atomic forces can be propagated from particle to particle is the speed of 
light. The universe communicates at the speed of light. 

At the dawn of the twentieth century it became clear that the speed of 
light is an invariant over all the events of the universe. The velocity of 
propagation of the gravitational or electrical force is measured as the veloc- 
ity of light in all reference frames, no matter what their velocities. This is 
contrary to the Galilean view of space and time which has shaped our jour- 
ney to the heavens and the elements. In the Galilean world space and time 
are separately homogeneous and isotropic according to the transformation 
law (3.2). Time flows absolutely and forces propagate infinitely fast for all 
observers. 

Motion with velocities of the order of the speed of light is described as 
relativistic motion. At velocities which approach the speed of light, the 
finite speed of propagation of forces becomes noticeable. Whereas space 
and time are absolute in the Galilean world, the speed of light is the true 
absolute. Space and time are the relative notions. They are local properties 
of a particular reference frame. 

Since the speed of light is such a large quantity compared to the human 
scale (approximately 186,000 miles/sec or 300,000 km/sec), the discovery 
that forces propagate at a finite speed did not occur until the late nineteenth 
and early twentieth centuries. If the speed of light were to be absolute over 
all reference frames, something else had to give. Time was this quantity; 
and the idea of time becoming a relative notion peculiar to each observer 
rather than an absolute quantity which ruled all motions proved to be a 
great stumbling block. 
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In the late nineteenth century H. A. Lorentz worked out the transforma- 
tions of space and time which maintain the speed of light constant between 
any two reference frames. (These famous transformations now bear his 
name.) In so doing he found that the time coordinates of each reference 
frame are not the same. They shrink and expand with respect to one 
another depending upon the relative velocity between the two reference 
frames. This result was so confounding that Lorentz called one of the times 
the “absolute time” and the other the “local time.” In reality neither time 
coordinate can claim a status different from the other. Both are on the 
same footing. There is no absolute; there are only local times. Albert Ein- 
stein was the first to truly affirm this conclusion as a profound revelation of 
nature and to foster its development into a watershed theory of twentieth 
century physics. 

The implications for space and time which flow from forces which propa- 
gate at a finite speed may be drawn out in the following way. A gravitational 
or electrical force is initiated. Think of this force as a wave which prop- 
agates at the speed of light c. Let the spatial coordinates X\,X2,X3 mark 
the wave front of the force and let t record the time in this reference frame. 
The wave front of the force propagates a distance ct in the time interval t. 
But this same distance between the initial and final positions of the wave 
front has the magnitude X\ + xf 2 + X3 2 . The quantity 

s 2 = (ct) 2 - X 1 2 - x 2 2 - x 3 2 (6.1) 

must therefore be an invariant. This invariant of space and time which 
follows from the invariance of the velocity of light is known as the Lorentz 
interval. 

The Lorentz interval s marks the interval between successive wave fronts 
of an electromagnetic or gravitational force. Since these fronts move at the 
invariant velocity of light, the Lorentz interval is observed to be identical 
in all reference frames. For any two reference frames (x,t) and (x',t'), the 
Lorentz interval measured in each frame is the same: 

s 2 = (ct) 2 - Or,) 2 = (ct’) 2 - Or') 2 . 

The most elementary symmetries of the physical world are in space and 
time themselves. In familiar three-space, reference frames in which the 
law of motion is invariant must be open to all origins, translational direc- 
tions, and rotational orientations. These are the fundamental symmetries 
of homogeneity and isotropy. 

In Galilean space-time, homogeneity and isotropy are described by the 
ten symmetry parameters a, u, fl, and r which generate the Galilean rel- 
ativity transformations 

x' = x + Sa+ Sut + SS2 x x, 
t! = t + St. 



( 6 . 2 ) 




192 



The Shaggy Steed of Physics 



Space and time appear asymmetrically in Galilean relativity. Time is totally 
severed from the space coordinates in the second of the transformations 
(6.2). It is therefore absolute across all reference frames and plays no role 
in the isotropy of space. Rotational symmetry is only manifested in the 
space coordinates. This aesthetic flaw is directly connected with a physical 
flaw in which forces propagate infinitely fast. 

Where is the logical defect in the Galilean symmetries that underlies their 
experimental and aesthetic flaws? It is in their ambiguous sense of direction. 
As described in Chapter 3, Galilean relativity postulates two sources of 
directional symmetry, one translational described by a uniform velocity 
(such as walking along a straight line) and another rotational described 
by an angular orientation (such as pointing at a star). In hindsight we 
now know there is only one basis for the notion of direction; and it is 
rotational. The direction which arises through the apparent translational 
nature of the velocity vector turns out to be rotational when the velocity 
of light is accepted as an invariant of nature. True translational symmetry 
is restricted to symmetry with respect to origins. This is appropriate; for 
translations have to do with the lack of a preferred origin rather than the 
lack of a preferred direction. 

The flaws in Galilean relativity may be overcome by recognizing the 
hidden unity of space and time: they are bound together in a single four- 
dimensional space-time rather than in a three-dimensional space which 
stands separately from time. Instead of specifying a point in space with a 
three-component position vector which is a function of time, one describes 
events in space-time in terms of a single four- vector. Three components of 
this vector are the spatial components x\,X 2 ,x 3. The fourth (or “zeroth”) 
component, consistent with the dimensions of these spatial components, is 
the signal interval xo = ct. In this four-space the four components of space 
and time constitute the four-vector 



x\ = (x 0 ,x 1 ,x 2 ,x 3 ) = (ct, x). 

(A four-vector is distinguished with Greek suffixes, as in x\, A = 0, 1,2,3, 
while the boldface notation and Latin suffixes, as in x and a are reserved 
for three- vectors.) 

The magnitude of a four- vector is symbolized as x 2 . If it were a vector 
in Euclidean space, its magnitude would obey a Euclidean metric like that 
of Eq. (3.1): 

X — Xq T X\ %2 T X 3 . 

Such a Euclidean magnitude is contrary to the invariance of the Lorentz 
interval (6.1) required by the condition that forces propagate at the speed 
of light. The time component xf 2 must have the opposite sign of the spatial 
components x 2 . Space and time therefore constitute a non-Euclidean space 
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in which the magnitude of a vector is composed from its components by* 

2 2 2 2 2 2 / n q \ 

X = S = Xq — X\ ~ X2 ~ X% . (o.o) 

Contrary to the position vector in the Galilean world, whose magnitude 
x 2 varies with time, the fundamental vector describing position in four- 
space — the “event” of space-time — has a magnitude which is an invariant. 
This metric condition is the evolution of the three-dimensional Euclidean 
metric of space into a four-dimensional metric conditon of space-time. 

The search for the simplest symmetries of space and time is reminis- 
cent of the search of Copernicus and Kepler for a geometry of beauty and 
simplicity which reflected the solar system. The replacement of the Eu- 
clidean metric condition ( 3 . 1 ) by the Lorentz metric condition ( 6 . 3 ) in a 
non-Euclidean four-space is a visionary insight leading to a simplicity and 
beauty comparable to that of the Copernican replacement of the earth by 
the sun as the center of planetary motions. 

The blank canvas of mechanics must be homogeneous and isotropic in 
four-dimensional space-time. Since the magnitude of the four-vector is in- 
variant, the symmetry transformations may only shift the origin or change 
the direction of four- vectors. They may not alter their magnitudes. Trans- 
formations of vectors that change direction without changing magnitude 
are rotations. 

Rotations of space in which one space coordinate is rotated into another 
are familiar enough. But the unification of space and time in a single four- 
space leads to a notion unthinkable in three-space: the rotation of a space 
coordinate into a time coordinate. Because of the non-Euclidean metric 
( 6 . 3 ) of space-time, the rotations of space components into time compo- 
nents differ from Euclidean rotations and are sometimes called ’’pseudo- 
rotations.” Such a space-time rotation differs from a pure spatial rotation 
by having a rotation angle which is imaginary. These four-dimensional 
rotations come about in the following way: 

The symmetry group of relativistic motion is that of translations and 
four-dimensional rotations. In four-space there are six independent planes 
of rotation as described in Chapter 4 . Three of these planes, (x2, £3), 
(x3,xi), and (£1,2:2), contain the rotations among the spatial coordinates. 
The remaining planes, (2:0, 2:1), (2:0, 2:2), and (2:0, 2:3), contain the rota- 
tions between the time and space coordinates. Four-dimensional rotations 
are described by six symmetry parameters which may be arranged as two 
three-vectors as described in Chapter 4 . Rotations in the four-space of 



* There is a freedom in the overall sign of the Lorentz metric condition. One could 
equally well define the Lorentz interval as 




194 



The Shaggy Steed of Physics 



Lorentzian space-time are therefore described by six symmetry parameters 
which reside in two three-vectors and represent rotations in the six planes. 

The subspace of rotations among the three spatial coordinates x\, a; 2 , X 3 
is Euclidean; all components Xi contribute terms xf to the metric with the 
same sign. The rotations in these planes are the same as those in Galilean 
space-time and are described by the three-vector rotational symmetry pa- 
rameter fl. 

The rotations between the space and time coordinates differ from the 
purely spatial rotations because there is a sign difference between Xq 2 = 
( ct ) 2 and any of the components x± 2 , xf 2 , xf 2 in the metric condition. 
The rotations between space and time coordinates are hyperbolic rather 
than Euclidean. Whereas Euclidean rotations turn through real angles and 
are described by trigonometric functions, hyperbolic rotations turn through 
imaginary angles and involve hyperbolic functions. A hyperbolic rotation 
may be obtained from a Euclidean rotation by making the angle of rotation 
imaginary. 

The rotations between the time and space coordinates are also generated 
by a three-vector symmetry parameter. What is it? The velocity symmetry 
parameter u, shorn of its role as a generator of translations in Galilean 
space-time, turns out to be the generator of the rotations between the time 
and space coordinates. The velocity symmetry parameter may be expressed 
more informatively as a rotation angle <f> given in terms of u by 

tanh (f>i = Ui/c. 

For infinitesimal rotations, this expression shows 6 <f> = 5u/c. The complete 
set of symmetry transformations in Lorentzian space-time are described by 
the pair of three- vectors, fl and <fi. The pure spatial rotations are described 
by fl and the imaginary rotations between the time and space coordinates 
(which have come to be known as “boosts”) are described by <f>. 

With directional symmetry completely described by rotations, the two 
cumbersome transformations (6.2) are swept away to be replaced by a single 
symmetry transformation of translations and rotations: 

x '\ = x \ + Sa \ + M-A/bV ( 6 - 4 ) 

Translations are described by the four- vector Sa\ while the four-dimensional 
rotations — known as Lorentz transformations — are described by the four- 
matrix SA\ 

The rotation four-matrix SAx^ possesses only six (rather than sixteen) 
independent components. These six components consist of the six com- 
ponents of two three-vectors which are the symmetry parameters of four- 
dimensional rotations, Sfl and 5(f). The group of transformations of space 
and time is again described by ten symmetry parameters. Four of these 
describe translational symmetry, the components of the four- vector 5a\. 
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Six describe rotational symmetry, the two three-vectors Sfl and 5(f) whose 
components constitute the six independent components of 6A\ 
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One sees that the purely spatial rotations described by <5f? form a three- 
matrix tucked into the lower right-hand corner while the space-time rota- 
tions described by S<p border it.* 

The two kinds of rotations contained within a Lorentz transformation 
may be illustrated in the following manner. If a reference frame x is 
obtained from a frame x\ by a rotation about the X3 axis through angle 
O 3 = 9 , the spatial coordinates X\ and X2 are transformed by the elementary 
rotation 

x\ = Xi cos 9 + x 2 sin 9, 

(6.5) 

X2 = —xi sin 9 + x 2 cos 9 , 

and the coordinates Xq = ct and X3 are unchanged. This is the pure spatial 
rotation part of a Lorentz transformation as shown in Fig. 6-1 fa). 

The hyperbolic rotation (or boost) of a space coordinate into a time 
coordinate is illustrated in Fig. 6-1 (b). Let the four- vector x' x define an 
event in a given reference frame and let the four-vector X\ define the event 
in a reference frame which translates along the X3 axis with velocity U3 = u 
relative to the first frame. The two coordinates x\ and X2 are unchanged 
in transformation between these two frames. Only ct and X3 are changed. 
The hyperbolic rotation of ct into X3 can be obtained by mimicking the 
Euclidean rotation of X\ into X2 but with an imaginary angle 9 — > itp. The 
trigonometric functions are transformed into hyperbolic functions and a 
space-time rotation is given by 



ct' = ct cosh <j> + X3 sinh (f > , 
X3 = ct sinh f> + X3 cosh <f>. 



( 6 . 6 ) 



The hyperbolic functions of the angular parameter <j> may be expressed 
directly in terms of the velocity parameter as 



sinh <f> = 



i/t 



cosh (f) = 



1 



y/1 ~U 2 /C 2 ' y/1 - V?/C 2 ' 

The Lorentz transformation (6.4) can therefore also be written 
ct+(u/c)x 3 , ut + x 3 



ct' = 



y/l — U 2 /c 2 



X3 = 



i/l — u 2 /c 2 



(6.7) 



* A finite rotation in four dimensions is obtained in a manner similar to that in three di- 
mensions. One exponentiates the matrix corresponding to the infinitesimal rotation 
x'^= to obtain the finite rotation x' x = e^ x u-x^. 
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XI 





Figure 6-1. Euclidean and Hyperbolic Rotations. In the Euclidean rotation between two 
space coordinates (a), the coordinates are related to the angle of rotation by tan# = 
X 2 /X 1 . In the hyperbolic rotation between the space and time coordinates (b), the 
coordinates are related to the angle of rotation by tanh <j> = x$ /xq = x^/ct = u%/c. 



These equations show what distressed H. A. Lorentz. According to the first 
of Eqs. (6.7), the time between events At' experienced when one is at rest 
with respect to them is greater than the time At between the same events 
when they are in motion 



At' = At/ y/l — u 2 /c 2 . 



Time passes more slowly in frames in motion relative to those which are 
fixed. 

In the limit u 2 /c 2 <C 1 the Lorentz transformation (6.7) reduces to the 
Galilean transformation (6.2) and the velocity of light disappears: 



x' 3 = X 3 + ut , 



t' = t. 
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Relativistic Action 

The classical action principle for relativistic motion follows that for 
Galilean relativity described in Chapter 3. It consists of the variation of 
the action due to the variations of the path and the variations due to the 
symmetry parameters. The action itself and its Lagrangian are different 
in relativistic motion, thus revealing a profound geometric significance ob- 
scured in Galilean space-time. 

The principle of least action in relativistic motion has a beautiful geo- 
metric form. The motion of a free particle takes place in such a way that 
its path from one event to another through four-dimensional space-time 
is the shortest possible; that is, its Lorentz interval f ds takes a shape in 
which its length is an extremum — a minimum or maximum. Such extremal 
curves are known as geodesics. For example, the geodesics of a sphere are 
the great circles which lie upon its surface. The shortest distance between 
any two points on the surface of a sphere lies upon a great circle which 
intersects them. 

The action principle reaches an epitome in relativistic space-time. It tells 
us that all motion is orchestrated by this simple law: in their movement 
between any two points in space-time particles take the shortest paths 
possible. In a deep sense, this is the only law of motion; for Einstein has 
shown that this principle is also true not only for a free particle but also 
for particles gravitationally and electrically interacting with one another. 
When interaction takes place, the metric of the space in which the path 
exists is no longer the simple Lorentz metric but is shaped by the presence 
of mass and energy. 

The action is in essence the path length: 




The particle mass m and the speed of light c are coefficients which provide 
the proper dimensions. The negative sign is chosen because the interval 
minimizes S. The principle of least action with only path variations (sym- 
metry parameter variations will be considered shortly) takes the form 



^•Spath — 




ds = —me 





( 6 . 8 ) 



Since S = f L dt, the Lagrangian for a free particle in relativistic motion is 
L = dS/dt or 

L = —me ds/dt = —me 2 y/l — x 2 /c 2 . 

The momentum follows as 



p = dL/dx = mxj \J\ — x 2 /c 2 . 



(6.9) 
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The total energy (or Hamiltonian) is given by H = p ■ x — L. The 
Lagrangian may be expressed in terms of the momenta through Eq. (6.9) 
as 

L = —me 2 / y/l + p 2 /m 2 c 2 , 

where p 2 denotes the magnitude of the three- momentum: p 2 = pi 2 +P 2 2 + 
P 3 2 . The Hamiltonian then takes the form 

H = c^/p 2 + m 2 c 2 . (6.10) 

The three components of the momentum and the energy may be gathered 
together as the four-momentum, 

P\ = (po,p)- 

The fourth (or zeroth) component of the momentum turns out to be the 
bearer of the total energy of the particle H = £. This can be seen by 
rearranging Eq. (6.10) in the form 

(£/c) -pi -p 2 -P 3 =mc, 



which corresponds to the metric condition for the four-momentum p\, 



2 2 2 2 2 
PO ~P 1 ~P2 ~P3 = P ■ 



( 6 . 11 ) 



One finds the zeroth component of the four-momentum is fixed by the 
energy and its magnitude by the mass of the particle: 

Po = S/c, p = me. 

These same relationships with Eq. (6.9) show that the three-momentum is 
related to the three-velocity by 



P=(Po/c)x. (6.12) 

The canonical coordinates of motion in Einsteinian relativity are the 
two four-vectors of position and momentum and there exists an intimate 
correspondence between the components and magnitudes of the two. Aside 
from factors of c, the zeroth component Xq = ct of the four-position is the 
time and its canonical mate is the energy po = £ /c. The three-momenta 
p are the canonical mates of the space coordinates x. The magnitude of 
the four-position is the Lorentz interval. It corresponds to the magnitude 
of the four-momentum which is the mass of the particle. 

It is interesting to see the connection between the action as a path 
length in Einsteinian space-time (for which the Lagrangian is essentially 
the speed —meds/dt) and the Galilean action (for which the Lagrangian 
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is L = T — V). In the Galilean limit x 2 <C c 2 , the relativistic Lagrangian 
L = —rru?\J 1 — x 2 /c 2 becomes 

L = mi 2 / 2 — ?nc 2 

and is the same as the Lagrangian L = T — V of a free particle in Galilean 
space-time where T = mi 2 / 2 and the potential energy is V = me 2 . The 
condition that a particle minimize its Lorentz interval therefore reduces to 
the minimization of the Lagrangian L = T — V in Galilean space time. 

When the action (6.8) is varied with respect to the symmetry parameters, 
invariants of the motion emerge. Translational symmetries for both space 
and time are now contained in a four-vector symmetry parameter, Sa\ = 
( cSr,Sa ). The directional symmetries are now purely rotational and are 
contained in the symmetry parameters 5fi and 8<j> = Su/c. 

The relativistic action when subjected to the symmetry variations is 
formally the same as that for the Galilean symmetries (Note 1), 

^symmetries = A P • 6a— AJ • 6fi - AN ■ 6u - A H6t, (6.13) 

however there is a change in the meaning of the invariant N which cor- 
responds to the velocity symmetry parameter u. The conservation laws 
A() = 0 lead to the ten invariants P, J, N, and H. The four-vector 
P\ = J2 a P\ a is the total four-momentum of the system. It consists of 
space and time components P\ = (Po,P). The space components are the 
same as those for Galilean relativity whereas the time component, within 
factors of c, is the total energy 

p o = J2po a = Sic. 

OL 

The angular momentum J is the same as that in Galilean relativity; 
but the mass-center invariant N is different. In Einsteinian relativity it 
is discovered to be more fundamentally the energy-center invariant. The 
center of energy X is defined by 

X = K-^EpOaXa- 

a 

The velocity of a particle is related to its momentum through its energy 
according to Eq. (6.12). This equation may be summed over all particles 
to become P 0 X = cP and then integrated to give 

N = (PX o - P 0 X)/c, (6.14) 



where Xo = Xq = ct. 

The relativistic invariant N is similar to the mass-center invariant N of 
Galilean relativity, except that the energy plays the role of the mass. The 
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Galilean limit occurs when p 2 <C m 2 c 2 in which case P$ = £/c — > me, and 
the energy-center invariant becomes the mass-center invariant 

N = (PX o - P 0 X)/c -► Pt - mX. 



Equivalence of Mass and Energy 

The expression for the Hamiltonian or total energy (6.10) of a parti- 
cle in relativistic motion is surprising in two ways. First, it is irrational. 
The Hamiltonian of a Newtonian particle is a rational function of the mo- 
mentum: H = p 2 /2m. The Hamiltonian of a relativistic particle depends 
upon the momentum through the radical PI = C\J p 2 + m 2 c 2 . The rational 
Hamiltonian of a Newtonian particle occurs in the limit p 2 <C m 2 c 2 for 
which expansion of the above radical gives 

H ss me 2 + p 2 /2m. 

Second, the relativistic Hamiltonian (6.10) introduces one of the most 
unanticipated connections ever to appear in physics. In the reference frame 
in which a free particle of energy H = £ and mass m is at rest (p = 0), one 
can immediately see that the particle possesses an internal energy contained 
by its mass given by a truly amazing relationship first discovered by Albert 
Einstein: 

£ = me 2 . 



The speed of light is invariant across all reference frames. This discovery 
of the abstract geometry of space and time in turn confronts us with an 
implication for the most concrete mechanical objects: mass and energy are 
equivalent. 

The implications multiply. A uranium 235 nucleus is not equal to the 
mass of barium 141, krypton 92, and 2 neutrons, its fission products. That 
tiny difference in mass between the uranium nucleus and its fission products, 
about one-tenth the mass of a hydrogen atom, corresponds to 200 million 
electron-volts of energy. The ground state energy of the hydrogen atom, 
\£ 0 \ = 13.6 eV, is the energy scale of all chemical interactions. The nuclear 
energy scale is therefore ten million times the chemical energy scale. The 
destruction wrought by one pound of mass consumed in a nuclear reaction 
is equivalent to that induced by ten million pounds of chemical explosives. 
Nature opened to us the stunning ramifications of relativistic invariance for 
matter and motion. And with that revelation we built a bomb. 

At this high peak on our journey through mechanics, we encounter the 
sad vista of physics, the mystery in matter comingled with mass destruction 
that is the hallmark of our age. This beautiful revelation of the inner 
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behavior of nature was not affirmed with wonder and love of the natural 
world. It was used for an impious attack on the creation itself. 



The Kepler-Einstein Coincidence 

The relativistic space and time of Einstein possess four-dimensional ro- 
tational symmetry in which the space-time rotations are imaginary. Kepler 
motion in Galilean relativity also possesses four-dimensional rotational sym- 
metry. The symmetries of these two quite different aspects of the physical 
world are the same, save for a flip of sign in the metric, a source of some 
amazement. 

The coincidence can be more clearly seen by examining the Poisson 
bracket algebra of the Lorentz invariants J and N. The Poisson bracket in 
relativistic motion is built upon the four- vectors x\ and p\: 

i = V ( dg £>g df 

l7,5J ^\dx x dpx dxxdpx 

where it is understood that A runs over all four coordinates and the sum is 
over all particles. 

The Poisson bracket (6.15) shows that J and N have the relativistic 
algebra, 

'If — eijkJkt [iV,;, J j\ — Cijk ^ k , [ A'f . Ay ] CijkJk/c , (6.16) 

which should be compared with the nonrelativistic algebra of the Kepler 
symmetries (3.48) in unbound motion for which the eccentrum becomes 
imaginary ( E — > iE): 

[Mi, Mj] = eijkMk, [Ei, Mj] = eijkEk, [Ei,Ej\ = (4.10) 

The Kepler-Einstein coincidence lies in the fact that, aside from the factor 
of 1/c 2 in the third of Eqs. (6.16), the relativistic algebra is the same as 
that of the Kepler invariants M and E of Eqs. (4.10) for free motion with 
M playing the role of J and E playing the role of N. The Poisson bracket 
[Ni, Nj] vanishes identically for Galilean symmetry as shown in Table (3.51). 
However, for the Einsteinian symmetry it cyclically reproduces the angular 
momentum as shown in the third of Eqs. (6.16). Only in the limit c 2 — > oo 
does the relativistic algebra (6.16) give the Galilean result. This is because 
the rotational symmetry generated by velocity boosts which produces the 
invariant N approximates a translational symmetry in the Galilean limit. 

It is important to observe that Eqs. (6.16) do not match the Kepler 
algebra for bound orbits for which the four-dimensional rotations are all 
real and the sign of the [£), Ef] bracket switches. Imaginary rotations enter 
when the motion is free for which the Kepler constant, the eccentrum, and 



(6.15) 
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the angles of rotations involving these coordinates become imaginary. The 
Kepler motion phase space in this case is non-Euclidean as is Einsteinian 
space-time. 

The Kepler-Einstein coincidence does not imply a causal connection be- 
tween relativity theory and Kepler motion. This is made clear both by 
the fact that there is no correspondence for bound orbits and, more impor- 
tantly, by the fact that relativistic Kepler motion, in contrast to Galilean 
motion, does not possess four-dimensional rotational symmetry at all. We 
are going to find shortly that the invariance of the eccentricity is broken 
in relativistic motion and the symmetry algebra (4.10) does not exist. The 
coincidence of the symmetries rather reflects the many ways in which the 
same mathematical forms appear within the diversity of the physical world, 
an instance of what V. I. Arnol’d has called “the mysterious unity of all 
things.”* 



Relativistic Kepler Motion 

As we move onward to describe the relativistic regime with two-body 
motion we shall be rewarded with stunning success followed by inescapable 
failure. In the relativistic crisis of two-body motion the two-body descrip- 
tion of electricity will successfully reveal significant relativistic structure in 
the hydrogen atom. But the description of two-body gravitational motion 
will be flawed from the start. In the relativistic regime electricity and grav- 
ity are revealed to no longer be the same kinds of forces. Both the two-body 
problem for gravity and electricity will ultimately lead to contradiction. 

Begin with a simple measure of the importance of the velocity of light. 
The maximum velocity in bound Kepler motion is the orbital speed of the 
ground state, = M/p,r o = k/M where ?’o = M 2 /hp is the radius of the 
ground state orbit. The significance of the speed of light in classical Kepler 
motion can therefore be summed up in the parameter 

(3 = v^/c = k/Mc, 

which is the ratio of the maximum speed attainable in two body motion to 
the speed of light. 

In quantum Kepler motion the ground state quantities may be con- 
structed from the same classical quantities by replacing the angular mo- 
mentum M by h. The importance of the speed of light in the hydrogen 
atom is summed up in the parameter 

a = k/Tic « 1/137, 



* V. I. Arnol’d, Catastrophe Theory , Third Edition, Berlin: Springer- Verlag, 1992, 
Chap. 16. 
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which is the ratio of the maximum speed of the electron in the hydrogen 
atom to the speed of light. This natural physical constant is known as the 
fine-structure constant. The importance of the speed of light in the Kepler 
motion of the atom is a universal constant of nature, a fixed, small number. 

Nature demands that relativistic effects in the atom be small. On the 
other hand, since (3 need not be a small quantity, nature has no such demand 
on the heavens. To the contrary, relativistic gravitational effects are the 
sine qua non of the large-scale cosmos. The difference between j3 and a, 
like that between h and h, is a reflection of the decisive difference between 
the heavens and the elements. 

A description of relativistic Kepler motion is formed from the union of 
the relativistic Hamiltonian for a free particle in the center of mass frame of 
motion and the inverse-square force potential V = —k/r thereby creating 
the Hamiltonian and Hamilton- Jacobi equation for the relative motion: 

Ti = c\/p 2 + p 2 c 2 — k/r = £. (6-17) 

The corresponding quantum-mechanical energy operator for relativistic mo- 
tion is obtained by substituting pi = — Ti 2 d 2 /dxt 2 for the squared momen- 
tum p 2 in Eq. (6.17): 

Tl = c\/~‘h 2 d 2 /dx i 2 + p 2 c 2 — k/r. (6.18) 

These relativistic Kepler Hamiltonians embody a naked contradiction: mass 
and momenta are Lorentz invariant but the potential V = — k/r is not. 

The irrational operator ^/— hi 2 d 2 /dx i 2 + p 2 c 2 in Eq. (6.18) plunges rela- 
tivistic quantum motion into ambiguity. The artistic openness of quantum 
mechanics is again before us as it was before P. A. M. Dirac who first 
puzzled over the operator (6.18). 

What is one to make of the square-root of an energy operator with both 
positive and negative energy states? The ambiguity of sign with which the 
square-root presents us became an imaginative opening for Dirac. After a 
difficult struggle, with several false steps, he ultimately interpreted the two 
possible signs of the square-root as signatures of two different particles: the 
electron and its antiparticle, the positron. The particle and its antiparticle 
are identical in mass but opposite in charge. 

Dirac sought an equivalent set of operators that reproduce Eq. (6.18) 
but which were rational and linear. He found that such a set of operators 
could be conceived if they were four-by-four matrices obeying simple com- 
mutation rules. Moreover, the wave function T to which the operators are 
applied should be conceived as a four- valued quantity, consistent with the 
four dimensions of space-time. Finally, the Dirac operators which imply 
Eq. (6.18) necessarily contain internal states which represent an internal 
angular momentum of the particle: the spin. However, these spin states 
possess only two eigenvalues, ±Ti/2. 
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In a brilliant stroke of disambiguation of the square-root of an operator 
Dirac was led to the discovery of antiparticles and the necessity of spin — 
both pillars of the microscopic world of elementary particles. The four 
components of the wave function can be thought of as representing two 
distinct particles — electron and positron — each with two spin components 
for a total of four components. 

Discovery is an erratic and perplexing business. Dirac’s ingenious rev- 
elation of the four-valued wave function upon which relativistic quantum 
operators act was so stunning that it could not be immediately compre- 
hended. Two of the waves are associated with positive energies; but two 
correspond to negative energies. Both Dirac and his colleagues working on 
quantum mechanics in the late 1920’s were initially devastated by the ap- 
pearance of the negative energy states which they felt marred this otherwise 
beautiful theory. Dirac’s first reaction was to declare them of no physical 
significance. Werner Heisenberg wrote Wolfgang Pauli in 1928 that “the 
saddest chapter of modern physics is and remains the Dirac theory.” 

Then, bit by bit, Dirac accepted his new child completely. He embraced 
the reality of the negative energy states which the mathematics proclaimed. 
But the struggle was not over. Dirac proposed that these negative energy 
states were protons (protons at that time were in want of theoretical ex- 
planation and the idea of whole new sets of antiparticles in nature was 
still too revolutionary). Dirac’s proton proposal also blighted the theory; 
for the symmetry of the wave function requires that the negative energy 
states have the same mass as the positive energy states and the proton 
was empirically known to be a thousand times more massive than the elec- 
tron. Finally, reassured by Carl Anderson’s experimental measurements of 
uniquely new particles with the electron’s mass but opposite charge, the 
electron’s antiparticle — the positron — was accepted. The physics commu- 
nity ushered in the inevitability of antiparticles as a deep aspect of nature. 

Let us now draw out the orbits of relativistic, two-body motion. The 
relative energy Ti and the angular momentum M persist as invariants in 
the relativistic regime; hence the pair of vector Casimir sets 






continue to be appropriate sets of invariants in the relativistic regime and 
the motion will separate in their splrerical-polar symmetry coordinates. But 
the third Casimir set contributed by the eccentricity invariant no longer 
exists because the eccentricity is no longer an invariant in the relativistic 
case and the motion no longer separates in parabolic coordinates. The 
portrait may only be drawn in spherical-polar coordinates. 

Write p 2 = Pr + M 2 /r 2 just as in nonrelativistic two-body motion. The 
Hamilton- Jacobi equation (6.17) can then be written as 

(£ + k/r ) 2 - c 2 {p 2 r + M 2 /r 2 + p 2 c 2 ) = 0 (6.19) 
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and may be compared with Eq. (4.33), its nonrelativistic counterpart. Rel- 
ativistic motion forces us to rethink our notion of the total energy. It tells 
us that the energy of material bodies is enormous, even if their kinetic en- 
ergy is slight; for the leading contribution to the energy is the rest energy 
pc 2 . The formerly zero level of energy in Galilean relativity is now pc 2 , the 
rest energy of the relative motion. Examination of Eq. (6.19) reveals that 
for £ < pc 2 the motion is bound, librating in the radial direction between 
the points r mm and r max (the points where p r = 0). There is a minimum 
or ground state energy £q given from Eq. (6.19) by 

So = V 1 ~ P 2 tic 2 = 2 ^ ( k 2 p/2M 2 ) , 

for which the orbit is a circle of radius 

r 0 = y/1 ~(3 2 ( M 2 /kp ) . 

These results should be compared with the nonrelativistic ground state 
energy and orbit radius given by Eqs. (4.47) and (4.48) which are So = 
— k 2 p/2M 2 and r 0 = M 2 /kp. For /3 <C 1, the expression for the ground 
state energy may be expanded as 

£ 0 = M c 2 -(l^i/3 2 + ...) (k 2 p/2M 2 ) 

and, with the rest energy accounted for, reduces to Eqs. (4.47) in the limit 
/3 — > 0. It can be seen that relativistic effects shift the ground-state energy 
level upward and contract the radius of the ground state orbit. 

The total action is S(r,<j),t ) = — St + S r (r) + S^(cj)). The angular mo- 
mentum P'f, = M is an invariant just as in nonrelativistic two-body motion. 
The form of the polar action 



5^(0) = M<f> 

is also unchanged. The radial momentum is taken from Eq. (6.19) to con- 
struct the relativistic radial action S r (r) = J p r dr : 



S. r (r) = J \J {S + k/r) 2 / pc 2 — M 2 /r 2 — p 2 c 2 dr. 

The trajectory equation dS/dM = dS r /dM + <f> = const leads to 
f M dr 



r 2 \J (S + k/r) /c 2 — M 2 /r 2 — p 2 c 2 



These trajectories may be compared with their nonrelativistic counterparts 
in Eqs. (4.50). The integral is of similar structure to that of Eq. (4.50) 
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involving a radical of inverse first and second powers of r. It is therefore 
expressible in trigonometric functions and the orbit is the precessing ellipse, 



r(</,) 1 + ecos[y / (l — /3 2 )</>] ’ 



(6.20) 



shown in Fig. 6-2. 

The libration points have a slightly different dependence on the invari- 
ants than in nonrelativistic motion. This can be seen by comparing the 
relativistic orbit (6.20) with the nonrelativistic orbit (4.52). In the non- 
relativistic case the counterpart of the latus rectum r* is ro = M 2 /kg in 
Eq. (4.52) and depends only upon the angular momentum; but the latus 
rectum of the relativistic orbit also depends upon the energy: 



So 

r * = r'°- 



The pericentron and apocentron are given in terms of the latus rectum and 
eccentricity by 



Cmin r*/(l + e), v max — r*/(l e) , 

and the eccentricity has the relativistic form 

2 (gff [ _ S 2 — (gc 2 ) 2 ' 

S 2 [ S 2 - (gc 2 ) 2 \ ’ 

where So is the relativistic ground state energy. In the limit ft <C 1, the 
ground state quantities, libration points, and eccentricity are identical to 
those for nonrelativistic motion when the rest energy level gc 2 is accounted 
for.* 

The precessing ellipse (6.20) is not generally a closed orbit. It is an 
ellipse which rotates about the focus, forming an unending succession of 
rosettes for which the path is not retraced. When <j> completes a cycle of 
27T, the argument of cos[ v / (l — /3 2 )</>] completes a cycle of y(l — /3 2 )27 t. The 
precession of the angle for each cycle is 

A</> = [1 - V(1 - P 2 )}^- 

* For (3 <C 1, both E and Eq are dominated by the rest energy: E = fie 2 + • • •. The 
square of E has the expansion 

E 2 = (/xc 2 ) 2 + 2 fic 2 (E — fic 2 ) + • • • , 

with a similar expression for Eq. Hence, as (3 — > 0 one finds r* — > ro and 

e 2 x _ (£ - pc 2 ) 

{£ 0 - pc 2 )' 



which should be compared with the second of Eqs. (4.48). 




6. The Hidden Unity of Space and Time 



207 





Figure 6-2. The Precessing Elliptical Orbit of Relativistic Motion. The eccentricity 
vector, drawn from the focus to the center (a), is no longer invariant as in nonrelativistic 
motion. It spins about the focus of the ellipse at the precession rate A (ft/T where T is 
the period of the orbit. The orbit is generally open (b); as time passes, it densely fills 
the annulus. The precession for each cycle is A (f> = [1 — y/(l — /3 2 )] 2n or for /3 2 <C 1, 
A cf) = 7 r/3 2 . 

If the orbit is to close upon itself, the precession angle A</> must be a rational 
fraction of 2n [A0 = (l/m) 2n where /, m are integers]. This may only occur 
for special values of (3. For (3 2 <C 1, the precession angle is approximately 
given by 

A<f> = tt/3 2 . (6.21) 

The eccentricity vector, the hidden invariant of nonrelativistic, two-body 
motion, is no longer an invariant. Consistent with an open orbit, there is no 
final isolating invariant for relativistic, two-body motion. The eccentricity 
vector, anchored to the ellipse by tip at the center and by tail at the focus, 
now spins at the precession rate A (j>/T where T is the period of the orbit. 

The open orbit of a precessing ellipse is the classical manifestation of 
the breaking of the closed-orbit degeneracy by the Einsteinian symmetry of 
space-time. This degeneracy should also be relativistically broken on the 
quantum level resulting in uniquely different eigenvalues of the energy for 
each state of angular momentum. If the degeneracy were intact, all states 
of angular momentum would possess the same eigenvalue of the energy. 

Relativistic, two-body, quantum motion described by the Hamiltonian 
operator (6.18) or, more precisely, Dirac’s version of it, reveals that the de- 
generate energy levels of the hydrogen atom are indeed broken by relativistic 
effects. A given energy level is split into a bundle of nearly identical levels 
as shown in Fig. 6-3. Since these split levels result from relativistic effects, 
they must be dependent on the parameter summarizing those effects, the 
fine-structure constant a. The split energy levels are separated by gaps pro- 
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portional to a 2 . The split levels differ from one another by small amounts 
visible in the spectral pattern of radiation emitted by the atoms: about 
a 2 ~ 1/10,000 of the principal energy level £ = (—k 2 p,/2Ti 2 )/n 2 . These 
finely split levels correspond to different values of the angular momentum. 
The existence of different energy levels for each angular momentum quan- 
tum number l is the sign that the inverse-square force degeneracy of the 
hydrogen atom is broken by relativistic effects. 

Arnold Sommerfeld, at a time in which a rigorous formulation of quan- 
tum mechanics was not yet fully born, extended the idea of Bohr that 
classical motions described by trajectories could be transformed into quan- 
tum motions by requiring that the trajectory support an integral number 
of particle waves. Sommerfeld generalized Bohr’s idea by requiring that the 
actions generated in one period = j> pidqt be integral multiples of h. In 
this way the actions of classical motions could be directly transformed into 
quantum-mechanical eigenvalues. 



Energy 



Nonrelativistic Relativistic 

(Bohr, Pauli) (Sommerfeld, Dirac) 



n = 



n = 2 




n = 3; l = 2; j = 5/2 
n = 3; l = 1,2; j = 3/2 
n = 3; l = 0, 1; j = 1/2 

n = 2; l = 1; j = 3/2 
n = 2; l = 0,1; j = 1/2 




Figure 6-3. Relativistic Effects on Hydrogen Atom Energy Levels. To the left are the 
nonrelativistic energy levels postulated by Bohr and confirmed by Pauli. The split energy 
level shifts due to relativistic effects yielded by the Sommerfeld and Dirac theories are 
exaggerated by the factor 1/a 2 . The quantum number j which is the sum of the spin 
and orbital angular momentum quantum numbers is related to the quantum number l 
as j = l d= 1/2, except for l = 0 when j = 1/2. 
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For the relativistic, two-body problem whose solution is a precessing 
ellipse, Sommerfeld’s rule yields the energy levels 

1 + -0-f)l' 

n \j 4n ) 

where j is the angular momentum quantum number. 

Sommerfeld’s calculation was surely inspired. The solution of the Dirac 
equation for the hydrogen atom, which inherently introduces spin, confirm 
Sommerfeld’s result which knew nothing of spin. They show that the total 
angular momentum J of the atom is no longer just the orbital angular 
momentum M. It is now given by 

J = M + S 

and includes the intrinsic spin angular momentum of the electron S. The 
eigenvalues of the spin S are S = ±fi / 2 so the total angular momentum 
eigenvalues are j = l ft 1/2 for all n > 1 and j = 1/2 for n = 1. The only 
appearance of spin in the Sommerfeld calculation is in the interpretation of 
the quantum number j. 

From an uncertain foundation in classical mechanics ignorant of spin, 
Sommerfeld built a structure to the exact result yielded by the Dirac equa- 
tion, an equation built upon spin. This happy coincidence rests delicately 
upon a cancellation of energy shifts due to the coupling of the electron’s 
internal spin with its orbital motion on the one hand and spin-spin cou- 
pling contained within the Dirac equation on the other. The Sommerfeld 
solution emerges as the classical survivor of quantum cancellations. 

There are further splittings of the energy levels of the hydrogen atom 
unaccounted for in the Dirac equation which are proportional to higher 
powers of the fine-structure constant. These effects as well as the inter- 
actions of charged particles at high energy cannot be described by two- 
body interactions. Ultimately, two-body motion becomes a contradiction 
for high-energy interactions of electrons and positrons at relativistic speeds. 



(Ar/z/2 Ti 2 ) 



The Limits of Two-Body Motion 

Relativistic, two-body motion illuminates much of the fine-structure of 
the hydrogen atom. Its success is reaped from the small value of the fine- 
structure constant. Let us now turn to the heavens and the gravitational 
two-body problem to see if the relativistic two-body problem illuminates 
and explains fine-structure in the heavens. 

Mercury, the smallest planet, exhibits significant precession of its el- 
liptical orbit. Most of this precession can be explained by nonrelativistic 
perturbations induced by Mercury’s giant companions, Jupiter and Saturn. 
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But there is a stubborn discrepancy not accounted for by these planetary 
perturbations. The measured rate of precession of Mercury’s ellipse is 574 
sec of arc per century. Detailed nonrelativistic calculations of the pertur- 
bations of Mercury by the giant planets give a precession rate of 531 sec 
of arc per century. Somewhat over 40 sec of arc are missing and a search 
for those 40 sec of arc progressed through most of the late nineteenth and 
early twentieth centuries. 

Does relativistic, two-body motion described by the Hamiltonian (6.17) 
have that additional 40 sec of arc? It does not. When one puts the num- 
bers for Mercury’s orbit into the precession, Eq. (6.21), only about 1/6 
the required deficit is to be had. Relativistic, two-body motion yields the 
right dynamical structure but the wrong precession rate. Though its quan- 
tum counterpart well represents much of the fine electrical structure of the 
atom, the relativistic, two-body problem is an unreliable guide to the fine 
gravitational structure of the heavens. The contradiction contained in the 
two-body relativistic Hamiltonian (6.17) is a fundamental contradiction for 
gravity. 

The relativistic two-body problem for gravity, in contrast to it’s electri- 
cal counterpart, fails immediately as the general relativity theory of Ein- 
stein reveals. In the relativistic regime gravity and electricity are no longer 
structurally identical kinds of forces. Though both become inverse-square 
forces in the nonrelativistic limit, the electrical force ultimately has a vector 
character while the gravitational force has a tensor character, that is, its 
structure is like that of the product of two vectors. 

Gravity requires a deepening of the idea of Lorentz invariance. Albert 
Einstein beautifully imagined such an extension of Lorentz invariance in 
his theory of general relativity in which gravity appears as the geometric 
response of space and time themselves to the presence of mass and energy. 
Indeed, the prediction of general relativity for the precession of the ellipse 
in the limit /3 2 <C 1 is 

A cj) = 6 tt(3 2 , 

and is to be compared with Eq. (6.21). Einstein’s general relativity re- 
sult yields an uncanny 43 sec of arc for the relativistic contribution to the 
precession of Mercury’s ellipse each century. 

The relativistic crisis of two-body motion is inescapable. The finite speed 
of propagation of forces and two-body motion inherently contradict one 
another. When one body suffers a change in position or velocity it transfers 
energy and momentum which cannot reach a second body until after a finite 
interval — the Lorentz interval. The energy and momentum have left the 
first body but they have yet to reach the second. Where do they reside? 
In additional bodies which are the bearers of the force. The energy and 
momentum which have left the first body but have yet to reach the second 
body are carried by force-bearing particles. 

The particle which is the bearer of the force in the case of the electrical 
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force is the photon. The force-bearing particle in the case of the grav- 
itational force is the graviton. In the particle representation of a force a 
particle which interacts gravitationally with another emits a graviton which 
propagates at the speed of light to the other where it is absorbed. A sim- 
ilar picture exists for the interaction of two charged particles such as the 
proton in the atomic nucleus and an electron where the particle emitted 
and absorbed is the photon. For the motion of an electron about a charged 
nucleus the electrical force can be understood as a steady stream of virtual 
photons which are being continuously emitted and absorbed by the electron 
and the charged nucleus. 



Reprise 

Relativistic motion always involves more than two bodies. In the sim- 
plest motion of two charged or gravitating bodies a third entity — the force 
bearing particle — must necessarily be considered under relativistic condi- 
tions. The minimum number of bodies involved might seem to be three — 
two charged particles and one photon in the case of electricity or two mass 
particles and one graviton in the case of gravity. But this is not the case. 
Streams of unlimited numbers of force-bearing particles are required to 
represent the persistent nature of force. All interactions involve infinitely 
many photons or gravitons. Both the hydrogen atom and relativistically 
orbiting heavenly bodies are infinitely many-body motions. The split levels 
of Fig. 6-3 in the exact theory of quantum electrodynamics (which goes well 
beyond the Sommerfeld and Dirac theories) are actually split into even finer 
and finer sublevels. Each line is again another bundle of infinitely many 
fine lines. 

At relativistic speeds quantum wave functions represent both particles 
and their antiparticles. At high particle kinetic energies p 2 /2m ss me 2 , 
where m is the mass of a fundamental particle such as an electron, a 
plethora of particles are spontaneously created out of the vacuum. This 
is because the vacuum is not an abyss of nonexistence. Rather, it is an en- 
ergetic sea of elementary particles and antiparticles spontaneously created 
and annihilated in a dynamical equilibrium. 

A high-energy particle projected into the vacuum, the primordial sea 
of particles and antiparticles, excites a rich variety of interactions which 
bring forth an array of elementary particles. The relativistic motion of all 
elementary particles is, from their source in the vacuum, a rich and complex 
many-body motion. 




Chapter 7 



The Manifold Universe 



Relativistic speeds turn motion into an inherently many-body problem. 
But most motion at the human and planetary scale is not relativistic and 
therefore not inherently many-body. Solar system motions of macroscopic 
bodies including the atmosphere and oceans of a planet like the earth are 
well described nonrelativistically according to the mechanics of Galilean 
relativity. At the scale of the atom, the orbital speed of an electron is only 
a small fraction of the speed of light. The states of the hydrogen atom are 
well predicted by the nonrelativistic laws of quantum motion. 

Yet even in this Galilean limit the isolated interaction of two bodies is 
an idealization, closely approximating many motions — including that of our 
planetary system and the hydrogen atom— but far removed from a descrip- 
tion of others — such as the atmosphere and oceans of the earth and the 
complex molecules of life. Exact two-body motions are only an approxi- 
mation because the world consists of more than two bodies. The fluids of 
the earth’s oceans and atmosphere, formed of enormous numbers of atoms 
and molecules interacting with one another, sustain motion incomparably 
different from that of two bodies. Even on the level of Galilean mechanics 
two-body motion can carry us no farther. 

The passage beyond the two-body problem to the many-body problem 
opens up qualitatively new features of motion. These features are inherent 
in the Hamiltonian mechanics flowing from the principle of least action; but 
it is impossible for them to appear in the motion of only two bodies bound 
by electricity and gravity. When more than two bodies interact classically 
they develop resonances; and resonances inherently harbor instabilities. 

A resonance can be understood in the following way. A many-body 
motion may be regarded as a set of independent two-body motions which 
are allowed to interact. The integrable two-body motions, viewed in angle- 
action coordinates, consist of a set of independent oscillating modes, each 
oscillator possessing a characteristic frequency. In integrable motion the 
oscillators are independent and their amplitudes neither shrink nor grow; 
they are the invariants I of the motion. 

When the independent modes are allowed to interact, the oscillators per- 
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turb one another and their amplitudes and frequencies change. Moreover, if 
one of the modes perturbs another at the same frequency and phase, a reso- 
nance develops in which a mode grows in amplitude much in the manner in 
which a swing is pumped in amplitude by pushing at the same frequency as 
the swing frequency. The amplitude of a resonant oscillator grows without 
bound: it is said to be unstable. 

Many-body motion is rich in interacting oscillator modes which are free 
to closely approach one another in frequency thereby exciting a resonance, 
a situation impossible in the motion of only two bodies. An unstable reso- 
nance persists until it becomes strong enough to change either its frequency 
or the frequency of the perturbation thereby “detuning” the instability. 

Resonances and their accompanying instability manifest themselves in 
phase space by creating a chaotic phase flow. The streamlines of a flow 
with unstable resonances form a complex web with tangled regions of dis- 
order. This tangled disorder in the streamlines sends the trajectories em- 
anating from neighboring points off on wildly different directions. Chaotic 
phase flows are therefore said to exhibit “sensitive dependence on initial 
conditions.” In a chaotic region of phase space neighboring points, consid- 
ered as initial conditions, exist which, no matter how close to one another 
they may be brought, belong to trajectories that diverge by uncontrollably 
large amounts. Resonances, instability, and sensitive dependence on ini- 
tial conditions, are all manifestations of the same underlying phenomena in 
many-body motion perhaps best captured by the single word “chaos.” 

One can gain some understanding of a chaotic motion by placing it in 
context with motion at the opposite end of the spectrum — “cosmic” motion 
also known as regular motion. The most cosmic motion is one so highly 
structured by symmetry that it is both integrable and fully degenerate (it is 
separable and all its separable parts are in synchrony). Such is the motion 
of two celestial bodies. In analogy to the flow of a fluid, the orderly flow 
of two celestial bodies is laminar. Its streamlines are closed on a smooth 
torus. They exhibit regularity and high degree of order. 

As one proceeds along the spectrum to motions of greater disorder, one 
next encounters quasiperiodic motions. These motions are integrable but 
nondegenerate. Since they are no longer in synchrony, they give the appear- 
ance of being chaotic; but their chaos is illusory because integrable motions 
possess symmetry sufficient to prevent the separate modes from interacting. 
Quasiperiodic motion cannot be resonantly excited. The apparent irregu- 
larity arises solely from the lack of synchrony between the various modes. 
The flows of such motions appear not to exhibit regularity (they densely 
fill the torus upon which they lie); but their streamlines have a highly or- 
dered structure (they lie upon an invariant torus) and they do not exhibit 
sensitive dependence on initial conditions. They are therefore not chaotic. 

Finally one encounters motion which is nonintegrable and hence cannot 
even be quasiperiodic. The symmetries of these motions are so broken they 
do not inhibit the separate coordinate modes from interacting. The motions 
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are therefore subject to resonances. Their flows are not only irregular; 
the orderly patterns of the streamlines typical of integrable motion are 
lost. The streamlines, no longer confined to noninteracting tori, are tangled 
in a complex web, and the flow exhibits sensitive dependence on initial 
conditions. These are the essential characteristics of chaotic motion. 

Contrary to the laminar flow of two celestial bodies, the flow of noninte- 
grable motion is generally turbulent like a storm-driven ocean. Two-body 
motion with inverse-square forces cannot be chaotic. But enter a third or 
more bodies and the motion is open to chaos. 



Beyond Two Bodies: Chaos 

The final encounter which two-body motion offers us is a glimpse into 
the rich complexity of mauy-body motion. Any inquiry into the nature of 
many-body motion begins with a sobering theorem formulated at the end 
of the nineteenth century by Henri Poincare and others which declares that 
all many-body motions from the three-body problem on are not integrable. 

Poincare found that the flows of many-body motions cannot be repre- 
sented by transcendental functions (infinite series of algebraic polynomials), 
the most general functions of classical mathematical analysis. He showed 
that these series diverge for many-body motions. Although the trajectory 
flow cannot be represented as a transcendental function, it can be numer- 
ically computed. But even here deep questions about the precision and 
proper interpretation of the computed flow arise because of the sensitive 
dependence on initial conditions. 

Although the many-body problem is not integrable, Poincare discovered 
that the most significant properties of many-body motion could be revealed 
as perturbations of integrable two-body motions. In fact, most of the crucial 
and interesting properties of many-body motions are those which are close 
to integrable motions. These perturbed motions are called near-integrable 
motions. 

An example of a near-integrable motion is that of our own solar system. 
Each planet and the sun, considered in isolation, constitute an integrable 
two-body motion. Since the sun is so much more massive than any single 
planet, the effects of the planets on one another’s two-body interaction with 
the sun may be regarded as a small perturbation. The resulting motion is 
a near-integrable motion. 

A second example is the hydrogen atom in the presence of a magnetic 
field. The electron not only interacts with the nucleus; it also interacts 
with the magnetic field. This is a many-body interaction in which the 
magnetic field represents the effects of numerous electrons flowing in the 
magnet circuits. Rather than consider the hydrogen electron interacting 
with each of these electrons, one considers the magnet electrons organized 
into an ensemble which generates the field. The interaction is then reduced 
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to the interaction between the magnetic field, the hydrogen atom electron, 
and the nucleus. The motion may again be represented as a near-integrable 
motion about the integrable two-body states of the hydrogen atom. 

Viewed in phase space, one has a flow upon the invariant tori of the 
integrable two-body motions. These are described by the action variables 
J = ( Ji, J2, • • • , J s ) and their canonical mates, the angle variables a — 
(oq, «2j • • • , ct s ). When allowed to interact as many-bodies, the actions J 
are no longer invariant and the tori are perturbed by one another’s presence. 

The action S(a, J ) is the ultimate description of near-integrable motion. 
If one knows the action, one can find all the other dynamical variables. 
One begins with a known integrable motion with Hamiltonian Hq(J) and 
known action So = ctjJj. It is to be the basis of an approximation of the 
motion whose Hamiltonian differs from it by a small perturbation H'(a, J): 

H(a, J) = H 0 {J) + H'(a,J). 

The action of the many-body motion is 

S(a , J) = a.j Jj + S' (a, J), 

where J are new action coordinates (which differ from J since the J are 
no longer invariant after the perturbation is made) and S' {a, J ) is the 
response of the action to the Hamiltonian perturbation H'(a, J). Since the 
Hamiltonian is specified by the actual configuration of the many-bodies, 
the perturbation Hamiltonian H'(a, J) is a known function. The question 
is therefore this: what is the action perturbation S' (a, J ) induced by the 
Hamiltonian perturbation H'(a, J)? 

Poincare* found that one could obtain the crucial answers to this ques- 
tion in a powerfully generic manner. The actual flow clearly depends upon 
the particular form of the Hamiltonian, that is, the number of bodies and 
the particular form of the force law through which they interact. But it 
is a result of no small significance that the generic properties of the mo- 
tion are independent of the particular form of the Hamiltonian. Poincare’s 
method which has come to be known as canonical perturbation theory is 
summarized in Note 8. 

Poincare makes a near-integrable motion integrable by constructing a 
canonical transformation which takes the motion to new coordinates (d, J) 
in which the new Hamiltonian is a function only of the action coordinates, 
H = H{J ) only. This is accomplished by averaging the Hamiltonian over 
the angle variables (Note 8). Averaging over the angle variables is the 



* Poincare, H., Les Methods nouvelles de la mechanicque celeste, vols. 1-3, Gauthiers- 
Villars, Paris (1892-99); also available as a reprint from Dover Books, New York (1957). 
Poincare’s method has come to be formalized as canonical perturbation theory and is 
presented in standard works, for example, Chapter 2 of A. J. Lichtenberg and M. A. 
Lieberman, Regular and Chaotic Dynamics, New York: Springer- Verlag, 1992. 
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key to the solution of the motion which is otherwise nonintegrable, a pro- 
cedure whose justification, if not validity, rests upon the smallness of the 
perturbation. 

The generating function of this canonical transformation is the action 
S(a, J). It consists of the identity transformation a Mi and a perturbation 
induced by the many-body interaction S'(a,J). The Hamiltonian under 
the averaging procedure is mandated to have the form 

H(J) = H 0 + (H'(a,J)), 

where ( ) indicates an average over all the angle coordinates. This is the 
crucial step which makes near-integrable motion “integrable.” The Hamil- 
tonian may be expanded in a Taylor’s series in the deviations of the new 
actions from the old, J' = J — J , 

H(J)KH 0 +-^-J' + H'{a,J). 

Since lo = dH 0 /dJ and J ’ = dS' /da according to the first of Eqs. (3.30), 
comparison of these two expressions for H(J) reveals that the perturbed 
action S' is directly related to the Hamiltonian deviation from the average 

AH' (a, J) = H'(a, J) - (. H'(a , J)) 
as 

cj^S'(a, J) = -AH' (a, J). (7.1) 

da 

To this order of approximation it is permissible to replace a by a in S' (a, J) 
and H'(a, J). [A more detailed exposition of the steps leading to Eq. (7.1) 
may be found in Note 8.] 

For s pairs of conjugate position and momentum coordinates, the an- 
gular dependence of the action perturbations may be represented as an 
s-fold Fourier series in the angle coordinates with Fourier coefficients which 
depend upon the action coordinates 

S' (a, J) = 53 S' m (J)e i{&imi+a2m2+ ' +Slsrns) . 

m 



The quantities m = (mi, m^, ■ ■ • , m s ) take all the positive and negative 
values of the integers. 

The deviation of the Hamiltonian perturbation from the average has a 
similar Fourier representation, 

AH' (a, J) = AH' m (J)e i{aimi+&2m2+ - +Sl ° m °\ 
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By substituting the Fourier expansions for S'(a,J) and A H'(a,J) into 
Eq. (7.1), one finds the action for near-integrable many-body motion is 

AH' ( J')e i (“i m i+52m 2 H b a 3 m B ) 

. (7.2) 

rnioji + to 2W2 + • • • + m s u> s 



S(a, J) — cxjjj + 'y ' 



The action (7.2) describes the behavior of many-body motions quite 
acceptably for a wide range of initial conditions (which fix the s invariants 
J), for perturbations which are not too large, and over time scales which 
are not too long. It does nicely for the small perturbations of our planetary 
system due to the mutual interactions of the planets. 

To a first approximation, one may describe the solar system as a set of 
integrable two-body motions, one for each planet and the sun as described 
in Chapter 4. In the next approximation, the mutual interaction of the 
planets is included as the small perturbation. Individual angular momenta 
and eccentricities of the planets cease to be invariant in this approximation; 
however, the sum of the eccentricity vectors is again an invariant because 
of the averaging of Poincare’s method over the angle variables of all the 
planets. The sum of the angular momenta of the planets remains invariant 
because the total angular momentum is always invariant. The solar system 
as a whole therefore possesses invariant angular momentum and eccentricity 
vectors. 

The individual Keplerian orbits of each planet are modified in the follow- 
ing way. The major axis undergoes small oscillations about its two-body 
value. The eccentricity vector not only undergoes small oscillations about 
its two-body value; it is also set slowly spinning so that each Keplerian orbit 
acquires a precession. This effect is reflected in the time a planet spends 
in the vicinity of the sun and can induce effects on the planet’s climate 
with the same frequency signature. The period of precession of the earth’s 
eccentricity vector is of the order of ten thousand years. The earth’s ice 
ages are driven in large measure by the precession of its eccentricity vector. 

These many-body effects are not qualitatively different from the kinds 
of motions characteristic of two bodies. But there are other kinds of many- 
body motion contained within Eq. (7.2) which are radically different from 
the oscillating and precessing trajectories of the kind we have seen thus 
far. There exist motions for which Eq. (7.2) predicts a catastrophic phe- 
nomenon. The Fourier amplitudes of the action have the denominator 
TOiWi + TO 2 W 2 + • • • + m s u) s where the u>i = dH 0 /dJi are the natural fre- 
quencies of the unperturbed motion and the nil may take the values of all 
the positive and negative integers. This denominator may vanish: 



m\u)\ + to 2 w 2 + • • • + m. s LO s = 0 . 



(7.3) 



When it does, a resonance has appeared. 

Since the mrii are integers, the denominator (7.3) cannot vanish unless 
the motion is degenerate, that is, the u>i are rationally commensurate, as 
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they indeed are for both pure Kepler and Hooke motion.* Resonances pre- 
vent series expansions like Eq. (7.2) from converging and hence prevent 
the representation of many-body motion in transcendental functions. Al- 
though the elementary perturbation procedure utilized above cannot be 
used to calculate the motion at resonance, resonance is not an artifact of 
the perturbation procedure. It is a deep-seated aspect of many-body mo- 
tion revealed by more powerful secular perturbation theory and by direct 
numerical calculation of the trajectories using the Hamilton equations. 



A View of Chaos in Phase Space 

Integrable motion exists in phase space as flow upon tori, one torus 
for each integrable mode. What happens to these tori when the inte- 
grable modes they describe are subjected to many-body interaction in near- 
integrable motion? The flows remain bound to an invariant Hamiltonian 
surface. But other invariants which sustain the tori, such as the angu- 
lar momentum and eccentrum of two-body motion, need no longer exist 
when the motion becomes non-integrable. It might be thought that the or- 
dered integrable motion therefore disintegrates: that all the invariant tori 
are destroyed and the flows, no longer confined to tori, wander through- 
out phase space. Remarkably, this is uot the case, as one of Russia’s most 
distinguished mathematicians, A. N. Kolmogorov first proposed in the mid- 
twentieth century. His work, completed by his student V. I. Arnold and, 
independently, by Jurgen Moser, describes the way in which near-integrable 
motion is a rich composite of both order in deformed but unbroken tori and 
chaos in tori which have been broken. The theory Kolmogorov inspired, a 
graced moment in the history of celestial mechanics, is known as the KAM 
theory in honor of its three discoverers.) 

There are infinitely many rational numbers that satisfy the resonance 
condition (7.3). This is the situation for the first-order perturbation of 
the integrable motion that leads to (7.3). The tori corresponding to these 
infinitely many modes can be expected to be broken. But when a more com- 
prehensive perturbation procedure) is applied, ratios of natural frequencies 
are perturbed from rational numbers to real numbers which contain both 
the sets of rational and irrational numbers. The sets of rational and ir- 
rational numbers both contain infinitely many members. But the set of 
irrationals is “yet infinitely greater” than the set of rationals. The simplest 
way to grasp this fact of number theory is to observe that the irrational 



* See Eq. (4.69) and the ensuing discussion. 

f For a more extended discussion of the KAM theory, see Chapter 5 of Florin Diacu 
and Philip Holmes, Celestial Encounters: The Origins of Chaos and Stability, Princeton: 
Princeton University Press, 1996. 

J See “Secular Perturbation Theory” in Chapter 2 of A. J. Lichtenberg and M. A. 
Lieberman, Regular and Chaotic Dynamics, New York: Springer- Verlag, 1992. 
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numbers surrounding any rational number are so dense that if one selects 
any real number at random, the probability that that number is rational 
is zero. The predominance of irrational numbers in forming an ’’infinitely 
greater set” than the rationals was Kolmogorov’s key insight in seeing that 
most of the perturbed tori, being irrational in frequency ratios, will not be 
destroyed. He and his coworkers were able to prove that all the resonant 
and some of the nonresonant tori will be destroyed by a non-integrable 
perturbation. This is a small set in comparison with the nonresonant tori 
which survive. 

Many-body motion is therefore not generally pure chaos. It is, in the 
vicinity of a near-integrable motion, a mixture of both regular and chaotic 
motion. Both nonresonant tori and broken tori are commingled. But for 
large perturbations corresponding to many-body regimes far from near- 
integrable motion, the motion can become fully chaotic. 

Let us now examine what becomes of the motion in the vicinity of reso- 
nant tori when they break. Resonant tori disintegrate into sets of periodic 
resonant points. The set of resonant points entwines stable cycles (called 
elliptic points) and saddle points (called hyperbolic points). A cycle is 
a stable phase orbit, like the archetypal oscillation of a Kepler or Hooke 
mode, shown in Fig, 7-1 (a). A saddle is an equilibrium point consisting of 
both stable and unstable phase orbits shown in Fig. 7-1 (b). 





Figure 7-1. Stable Cycles and Mixed Stability Saddles. The cycle (a) is a stable equilib- 
rium point of bound two-body motion. The saddle (b) occurs in many-body motion. It 
consists of both stable and unstable sets of trajectories. Two trajectories approach, two 
leave, and all others skirt past. The unique trajectory of the approaching trajectory is 
called the separatrix because it separates trajectories with distinctly different futures 



Two exceptional curves intersect a saddle point, one consisting of ap- 
proaches to the equilibrium point, the other of retreat. All other trajec- 
tories skirt past. The curve containing the approaches to the equilibrium 
point is called a separatrix because it separates trajectories with distinctly 
different futures. Two points, V and Q , initially close to one another on 
skirting trajectories, but on opposite sides of the separatrix, will be swept 
into completely different regions of the flow after passing the saddle. More- 
over, the velocities of the particles on skirting trajectories slow to near zero 
as they approach the saddle. These particles spend large amounts of time 
in the vicinity of the hyperbolic point of the saddle. 
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Two different kinds of saddle points can exist as shown in Fig. 7-2. 
Heteroclinic orbits (a) consist of trajectories for which the approach and 
retreat curves are distinctly different trajectories. When approach and 
retreat lie on the same orbit (b), the equilbriium is said to be homoclinic. 
Homoclinic equilibria are seats of chaotic behavior. 




(a) heteroclinic orbit (b) homoclinic orbit 



Figure 1-2. Heteroclinic and Homoclinic Orbits. Heteroclinic orbits (a) consist of tra- 
jectories for which the approach and retreat curves are distinctly different orbits. For a 
homoclinic orbit (b), approach and retreat lie on the same orbit. 




(a) first approximation (a) second approximation 

Figure 7-3. The Homoclinic Tangle. A first approximation to a homoclinic orbit (a) 
reveals it is composed of many homoclinic intersections called primary homoclinic points. 
A second approximation (b) shows primary and secondary homoclinic points. 

A point Q on a homoclinic orbit is a member of both a stable and 
unstable set. It therefore must be an intersection point of trajectories from 
both sets. The existence of one lromoclinic intersection implies infinitely 
many other homoclinic intersections and the picture drawn in Fig. 7-2 (b) 
is but a rough schematic of the actual portrait in phase space. Better 
approximations are shown in Fig. 7-3; but the truth is, a lromoclinic orbit is 
an exceedingly complex structure best described as a “lromoclinic tangle.”* 



* For a more thorough discussion of homoclinic orbits and chaotic motion in general, see 
J. Guckenheimer and P.J. Holmes Nonlinear Oscillations, Dynamical Systems and Bifur- 
cations of Vector Fields. Applied Mathematical Science No. 42, New York, Heidelberg, 
Berlin: Springer Verlag, 1983. 
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A first approximation to the homoclinic tangle is shown in Fig. 7-3 (a). 
We see that the orbit is actually an interconnection of many lromoclinic 
points called primary homoclinic points. A second approximation is shown 
in (b) which reveals the trajectory (a) of primary intersections is actually 
composed of further intersections with itself called secondary homoclinic 
points. We also see that the slowing of particle motion near the saddle 
point allows the trajectories to be stretched out across phase space as the 
orbit draws closer and closer to the saddle point. The approximations 
can be continued and result in a complex web of “infinitely fine mesh” as 
Poincare described it. 

These saddle points spawn lromoclinic tangles and chaotic behavior; yet 
they are trapped between surviving invariant tori. In this manner, order 
and chaos simultaneously coexist in many-body motion. 



Chaotic Kepler Motion 

The chaos introduced by many-body motion may be illustrated with the 
two intimately related motions through which the two-body problem has 
led us, those of Kepler and Hooke. The Hamiltonian 

H = p 2 /2p-k/r + k P 2 /2, (7.4) 

where p is the projection of the position vector onto the plane X 3 = 0, 

2 2,22 2 

p = 2:1 + x 2 = r - x 3 , 

combines both motions. Aside from a constant term not included here, this 
is the Hamiltonian for the hydrogen atom in the presence of a magnetic 
field aligned with the polar axis x%. We are going to view this motion as a 
classical rather than quantum motion. The orbits of interest are therefore 
the outer orbits with quantum numbers n 1 for which the motion is 
approximated as a classical motion. 

In the limit in which the electron is free of the magnetic field and interacts 
only with the nucleus one has Kepler motion with the Hamiltonian 

TL = p 2 / 2p — k/r. 

This motion possesses a full set of isolating invariants for which the eccen- 
trum E completes the conservation law invariants Ti and M. The trajec- 
tory for bound motion is an ellipse perpendicular to the angular momen- 
tum vector whose center lies at a focus of the ellipse as shown in Fig. 7- 
4 (a). The fundamental frequency of the motion is the Kepler frequency 
ojk = \/8|7f| 3 /k 2 p given by Eq. (4.55). 
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In the limit in which the electron is free of the nucleus and interacts only 
with the magnetic field aligned with the polar axis X 3 the relative motion 
is that of the Hooke Hamiltonian 

U = p 2 / 2/z + k P 2 / 2 . 

The motion is that of a particle bound by the linear force but confined by 
the magnetic field to the plane perpendicular to the X 3 or polar axis. (It 
is a unique property of the magnetic force that it cannot act on a charged 
particle in its own direction; hence the potential energy term in the Hamil- 
tonian is independent of X 3 : it is expressed in terms of p 2 rather than r 2 .) 
The coefficient k depends upon the strength of the applied magnetic field. 
This motion also possesses a full set of isolating invariants with the Hooke 
invariant matrix A completing the conservation law invariants. The motion 
is integrable and the trajectory is an ellipse lying in the plane perpendic- 
ular to the polar axis whose center is the center of the ellipse as shown in 
Fig. 7-1 (b). The fundamental frequency of the uncoupled Hooke motion is 
uh = 

If the electron is now considered to be the hydrogen atom electron, the 
Hamiltonian describes both interaction with the nucleus and interaction 
with the magnetic field and is given by Eq. (7.4). If the motion were 
strictly two-body, the full position vector r 2 would appear in the place of 
p 2 and the combined Kepler-Hooke Hamiltonian would be separable. The 
confinement of the Hooke term to the plane is the effect of the magnetic 
field — all those many-body electrons in the magnet circuits interacting with 
the hydrogen atom electron. 

Another way of describing this situation is that the Kepler inverse-square 
force potential 1/r possesses complete rotational symmetry about all three 
axes but the Hooke potential p 2 = r 2 — X 3 2 possesses rotational symmetry 
only about the X 3 axis. As a result, the Hamiltonian (7.4) is not integrable; 
it is, at root, a many-body Hamiltonian. 

Even though there does not exist a coordinate system in which the mo- 
tion is separable, the separability of Kepler motion in parabolic coordinates 
and the presence of the projection p 2 in the Hooke term, a naturally oc- 
curring quantity in these coordinates, makes the parabolic coordinates £, 
ip </> the most revealing coordinates in which to view it. They are defined 
in Chapter 4 by 

xi = p cos (f > , X 2 = p sin (j > , 

X 3 = (£-??)/ 2 , r = (£ + ? 7 )/ 2 , 

with the projection p given by 

p =r - X3 = frj- 

From Eq. (4.38), the Hamiltonian (7.4) has the parabolic coordinate form 
(&>f/ 2p + rjp 2 / 2p) + ^-p 2 / 2/i - (7.5) 



n = 
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If one chooses the Kepler ellipse semi-major axis a as length scale and 
the inverse Kepler frequency ojk -1 = T / 2n as time scale, the energy TL = E 
takes the nondimensional value S = — The Hamiltonian (7.5) in nondi- 
mensional variables now scaled to these reference quantities takes the form 

H= ih (&, ' /2 + ^ /2) + ~ ih + \ in = - s- (7 - 6) 

The natural measure of the importance of Hooke motion compared to 
Kepler motion is the ratio of the Hooke and Kepler frequencies, 

e = y/k 2 n/8\TL\ 3 = u> H /u K , 

and is the natural measure of perturbation strength. The ratio of the 
potential energy in Hooke motion to that in the Kepler motion is e 2 . For 
e — > 0, the motion is pure Kepler. 

The phase space of the Hamiltonian (7.5)-(7.6) is six-dimensional; but 
the motion possesses two invariants, the energy TL and the polar angular 
momentum p ^ = M 3 . (The invariance of the remaining angular momentum 
components and the eccentricity e are destroyed by the presence of the 
magnetic field.) This means that a portion of the motion is separable — the 
polar mode corresponding to the invariant M 3 and it allows a reduction 
in the order of the Hamiltonian system of two (Note 6). This integrable 
mode has the polar action and angle variable a </, = </>. The action is the 
invariant = M 3 and the phase velocity is = dTL/dJ $ = Ms/f 'r] so 
that the separable mode is described by 

= (M 3 /£r))t + const , J $ = M 3 . (7.7) 

The remaining nonseparable modes constitute the reduced phase space 
which is four-dimensional. The remaining invariant, the energy TL, defines 
a three-dimensional surface in this space upon which the trajectories must 
wind. The known invariants therefore reduce the manifold of the flow from 
the full phase space to a three-dimensional space. This space turns out to 
be a three-sphere (Note 6). 

The integrable mode may be removed from the Hamiltonian by choosing 
coordinates aligned with the natural directions of Kepler motion, the three 
orthogonal directions M, e, and Mxe illustrated in Fig. 4-11. In spherical- 
polar coordinates it is appropriate to align M with the polar axis. In 
parabolic coordinates it is appropriate to align e with the polar axis in 
which case M 3 = 0 and the Hamiltonian becomes 

n = ^ (f P y 2 + VP y 2 ) -^- + ^ v = ~ (7.8) 

With M 3 = 0, the electron cannot orbit the x 3 axis. The polar mode phase 
velocity in Eq. (7.7) correspondingly vanishes and the motion lies in the 
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plane <f> = const. The case of pure Kepler motion (e = 0) is that of an 
ellipse lying in this plane parallel to the X 3 axis as shown in Fig. 7-4 (a). 
(This is in contrast to the orientation of the Kepler ellipse in the plane 
perpendicular to X 3 used throughout Chaps. 4 and 5 in spherical-polar 
coordinates because one is now dealing with parabolic coordinates in which 
e rather than M is aligned with X 3 and hence M 3 = 0.) 



x 3 X 3 





x 3 X 3 




Figure 7- 4 [. Combined Kepler and Hooke Motion, (a) Pure Kepler motion consists of 
rotations (e < 1 ) lying in a plane parallel to the polar axis and vibrations (e = 1 ) along 
the polar axis, (b) Pure Hooke motion consists of a Hooke ellipse perpendicular to the 
polar axis, (c) For M 3 = 0 the Hooke ellipse degenerates to a straight-line vibration 
perpendicular to the Kepler ellipse and the combined motion (d) takes place in the plane 
</> = const. 



The Kepler ellipse is a circle when e = 0 and collapses into a straight-line 
vibration along the X 3 axis whenever the angular momentum M completely 
vanishes. ( M is no longer an invariant so this can occur in the course of 
the motion.) As was shown in Chapter 4, the vibratory motion to which 
the Kepler ellipse degenerates when the angular momentum vanishes corre- 
sponds to e = 1. This is a collisional trajectory and it is distinct from the 
rotational Kepler motion because it passes through the origin. The Kepler 
motion therefore consists of rotations (e < 1) and vibrations (e = 1). 

Pure Hooke motion consists of an ellipse lying in the plane perpendic- 
ular to the plane of Kepler motion shown in Fig. 7-1 (b). For M 3 = 0, 
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the rotational modes are prohibited and the Hooke motion consists only 
of the straight-line vibration perpendicular to the polar axis (and to the 
Kepler vibration) as indicated in Fig. 7-1 (c). The uncoupled motions for 
M 3 = 0 are Kepler rotations and mutually perpendicular Kepler and Hooke 
vibrations. 

The combined motion takes place in the plane 4> = const as indicated by 
Eq. (7.7). The motion is a competition between rotational and vibratory 
modes. The hydrogen nucleus attempts to draw the electron into a Kepler 
rotation in a plane parallel to the polar axis or a vibration along the polar 
axis; the magnetic held attempts to draw it into a Hooke vibration per- 
pendicular to this axis. The magnetic force acting on the electron on the 
Kepler ellipse is a maximum when the orbit is circular (e = 0) and vanishes 
for the Kepler vibration (e = 1). 

The Hamiltonian acquires an even simpler form if, instead of £ and 77 , 
squares are used as coordinates (Note 6): 



a 2 = f= (r + x 3 ), 



t 2 = rj = (r - x 3 ). 



(7.9) 



The corresponding momenta are 

Pa = dL/da = 2 y/lp^ 
and Eq. (7.8) reduces to 
1 



p T = dL/df = 2 ^/ 77 p v , 



n = 



<7 2 + T‘ 



{pl/2 + p 2 T /2)- 



a 2 + t 2 



—a r = 

2 2 



(7.10) 



The Hamiltonian (7.10) is not separable because of the term in e 2 and 
the trajectories must be obtained by numerical integration of the Hamilton 
equations. 

Poincare found that motion could be more simply grasped by directing 
one’s attention to a specially selected set of points of a trajectory rather 
than to the trajectory as a whole. These points are created by the trajectory 
on a plane of observation which Poincare termed a surface of section shown 
in Fig. 7-5. This is a surface in phase space which lies transverse to the 
flow. Each time the trajectory pierces the surface of section it generates a 
point. A bound particle, like a planet in the solar system or the electron of 
a hydrogen atom, pierces the surface of section every time it completes an 
orbit. 

Poincare showed each completion of an orbit maps a point in the surface 
of the section to a new point. In the case of the bound, two-body problem, 
the new point will, of course, map to the same position as the previous 
point. But for more general motions — of which the hydrogen atom in a 
magnetic field is one — the mapped points will be different. So, in general, 
curves will be created in the surface of section by the piercing points of 
an orbiting body. Poincare initiated an entire theory of the behavior of 
mapped points in the surface of section that is the basis for understanding 
chaos in dynamical systems. 
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Figure 7-5. Poincare Surface of Section. The surface of section is a surface in phase 
space which lies transversely to the trajectory. It is intersected by successive crossings 
of the complete trajectory. 

For coupled Hooke and Kepler motion, an appropriate surface of section 
is a plane perpendicular to the flow defined by a conjugate coordinate pair 
such as the ( r,p T ) plane at er = 0. This surface of section is shown in 
Fig. 7-6. The circular boundary corresponds to the limiting curve p a = 0 
in the surface a = 0 from Eq. (7.10): 

T 2 +p 2 T = 4. 

Flows which degenerate to a point in the Poincare section are important 
features of a motion and are known as the fixed points of the flow. Five fixed 
points lie in the surface of section of the hydrogen atom in a magnetic field. 
The center point (0,0) corresponds to the Kepler vibration (one may think 
of the electron vibrating along a line perpendicular to the page). The points 
(±y/2, 0) correspond to the rotational Kepler orbits (the electron emerges 
from one point and re-enters the section at the opposite point) . The points 
(0, i-^/2) correspond to the Hooke vibration (each is the turning point of 
the e = 1 collapsed Hooke ellipse). 

The integrable motion about which the perturbation of the combined 
motion is made is a pure Kepler ground state — the circle corresponding to 
the rotational fixed points on the horizontal axis of Fig. 7-6. The combined 
motion may be determined by numerically computing the trajectories from 
the Hamilton equations for the Hamiltonian (7.10). These trajectories, 
captured in the Poincare section, appear in Fig. 7-7. 

Once the perturbation, however small, is made, a significant shift in the 
motion occurs. This can be seen in Fig. 7-7 (a). The surface of section of 
the unperturbed motion consists of only the two rotational fixed points on 
the horizontal axis; but under even the smallest perturbation, the trajec- 
tories escape from these fixed points and range over the entire surface of 
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Figure 7-6. Surface of Section of the Hydrogen Atom in a Magnetic Field. The points 
are exhibited in the (r,p T ) plane at o = 0. The center point (0,0) corresponds to 
the straight-line Kepler vibration for which e = 1 (the electron vibrates along a line 
perpendicular to the page). The points ( ± 0) correspond to the rotational Kepler 

orbits (the electron emerges from one point and re-enters the section at the opposite 
point). The points (0, ±y/2) correspond to the vibratory Hooke motion (each is the 
turning point of the collapsed e = 1 Hooke ellipse). 



section. With the invariance of the eccentricity broken, the orbiting elec- 
tron is free to explore all eccentricity values ranging from rotational modes 
close to its original circle e « 0 to the vibratory mode e = 1. A small per- 
turbation introduced by the magnetic field e«l does not lead to a small 
perturbation in the eccentricity e. This global shift of the motion under a 
small perturbation is a result of the complete degeneracy of the hydrogen 
atom. 

A perturbed trajectory no longer continually pierces the surface of sec- 
tion in isolated fixed points; instead the orbit ticks off a multiple point pat- 
tern like that illustrated in Fig. 7-7. This pattern entwines three kinds of 
point sets: a continuous closed curve (including the vibratory fixed points 
for which the curve has degenerated to a point), a chain of closed loops 
called islands which are continuously revisited, and a chaotic spattering of 
points which are never revisited. 

A scurve in the surface of section corresponds to an open trajectory. This 
is because a closed orbit in phase space cannot create a closed curve in the 
Poincare section. It pierces the surface of section in a countable number of 
disconnected points. But an open orbit in phase space can create a closed 
curve because it pierces the Poincare section in an infinite number of points 
densely filling out the curve. Such curves are known as invariant curves. 
They can be seen in Fig. 7-7 (a) for which e = 0.19, a perturbation strength 
still close to the integrable Kepler motion. Each invariant curve in Fig. 7-7 
(a) corresponds to a particular trajectory generated by a particular set of 
initial conditions, all of which correspond to the same value of e. 
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(e) (f) 

Figure 7-7. Surface of Section [the (r,p T ) Plane] for Chaotic Kepler Motion. In (a) 
the perturbation strength is e = 0.19. Invariant curves encircle the primary resonances 
shown in Fig. 7-6. In (b) (e = 0.22), chaotic trajectories indicated by the spattered points 
and originating near the the separat rices have appeared. They become more numerous 
in (c) (e = 0.24) and (d) (e = 0.26) for which island chains are also in evidence. Invariant 
curves are progressively destroyed as perturbation strength increases to e = 0.30 in (e) 
and finally completely engulfed for e = 0.38 resulting in ergodic motion (f). [Courtesy 
H. Freidrich and D. Wintgen, Physics Reports 183, No. 2, 37-79, 1989] 
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The fixed points of the motion are resonant points because they corre- 
spond to closed trajectories. The two rotational resonances (i-^/2, 0) of the 
circular Kepler orbit in Fig. 7-7 (a) are intersected by invariant curves. The 
vibratory Kepler resonance which appears as the single point (0, 0) and the 
vibratory Hooke resonance which appears as the pair of points (0, ±^2) 
are encircled by invariant curves. The vibratory fixed points are stable to 
small perturbations in the sense that orbits close to them when perturbed 
maintain the same structure. 

The two invariant curves which intersect the rotational fixed points 
(±^2, 0) separate the vibratory modes from the rotational modes and are 
the separatrices of the motion. The rotational fixed points are unstable. 
Orbits near them are destroyed and different kinds of trajectories come into 
being once the perturbation is made. 

The unstable fixed points and the separatrices emanating from them 
are the seats of lromoclinic tangles and chaotic behavior; it is along the 
separatrices that chaos first develops as can be seen in Fig. 7-7 (b). These 
are indicated by both the island chains and the spattering of isolated points 
which have appeared in the place of the separatrices. Spattered points, 
unlike the points which make up the invariant curves, cannot be ordered. 
They do not define a curve. It is this lack of order which makes such 
trajectories chaotic. 

The different island chains correspond to different combinations of m\ 
and m 2 which closely satisfy the resonance condition (7.3). A chain of small 
islands surrounding its central resonant points corresponds to the same 
trajectory. The successive crossings of this trajectory jump from island to 
island, beginning at one island, moving successively over this island chain, 
and returning to the first where the sequence repeats. 

Chaotic trajectories are the outcome of resonances and their unstable 
separatrices. Many-body resonances are pervasive because a rational num- 
ber exists arbitrarily close to any irrational number. Resonant motions are 
therefore densely entwined with nonresonant motions in near-integrable 
motion. Thin layers of chaotic points commingled with island chains are 
actually distributed throughout the surface of section for all levels of per- 
turbation strength. These resonance layers and their island chains are very 
thin for small perturbations and disappear into the invariant curves on the 
scale of the pictorial accuracy of the figure. (Imagine looking at an invari- 
ant curve through a microscope and discovering that it is not constructed 
from a solid line but rather from infinitely fibered braids of finer and finer 
island chains.) For larger perturbation strengths, the island chains and the 
chaotic points are clearly visible in Figs. 7-7 (c)-(e). 

The stable vibratory fixed points are well isolated from the chaotic re- 
gions by their invariant curves for small perturbation strengths. The chaotic 
layers are isolated from one another and chaotic trajectories in one layer 
cannot intersect those in another. 

As the perturbation strength is increased, chaos invades more and more 
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of the phase space by destroying the invariant curves separating the islands 
as shown in Fig. 7-7 (d) and (e). The chaos progressively encroaches 
upon the stable fixed points and finally engulfs them as in Fig. 7-7 (f). 
When the last invariant curve is destroyed the chaos becomes global, filling 
the entire phase space. Motions whose flows fill entire regions of phase 
space are said to be ergodic. The trajectories which began as the fully 
degenerate trajectories of the unperturbed Kepler orbit filling a manifold 
of one dimension now fill the nonintegrable phase space of three dimensions. 



Chaos and the Quantum 

Classical many-body motion is inherently open to chaos as illustrated by 
the outer quasi-classical orbits of the hydrogen atom in a magnetic field. 
The complexity of chaotic motion and the smearing of the particle flow over 
the phase space gives chaotic motions a random or stochastic character. 
Quantum motion also has a stochastic character; but it originates in a 
distinctly different source in the quantum uncertainty principle. 

It might be expected that quantum motions with their inherent uncer- 
tainty and many-body motions with their inherent chaos when combined 
would produce intense stochastic behavior. In fact, the opposite is true. 
Quantum effects suppress classical chaos. This is because chaos feeds on 
phase space filling the largest scales down to the finest scales. Quantum 
effects limit the otherwise classically limitless fineness of phase space which 
chaotic orbits can enter. 

One can think of the uncertainty principle as a quantum symmetry. It 
manifests itself in the destructive and constructive interference of quantum 
waves and structures the flow in volumes of phase space on scales fixed 
by the Planck constant. Since phase space is built up of canonical pairs 
( qi,Pi ) whose product represents an action, those regions of phase space on 
scales Ag.j, A pi whose volume is V « ]”[■ Aq jAp* rts Ti s will be governed by 
quantum rather than classical mechanics. 

The interference of the waves in the quantum regions blocks some orbits 
and reinforces others thereby reimposing order upon chaotic motion which 
has broken the constraints of its classical symmetries. This can only occur 
for the chaotic motions which have reached the quantum level. The larger 
scales of phase space are free of the uncertainty principle and the chaos 
operating on these scales is not suppressed by the quantum. This effect 
occurs in the hydrogen atom in a magnetic held. The outer orbits which are 
quasi-classical are chaotic. As one progresses inward to the lower orbits, the 
phase how progressively becomes a held of quantum waves which suppress 
the chaotic how. For the hydrogen atom this is the scale of the lowest lying 
orbits. 




232 



The Shaggy Steed of Physics 

Cosmos and Chaos 



The hydrogen atom in a magnetic field illustrates the manner in which 
many-body motions evolve chaotically. Chaos manifests itself in the atom 
and the elements. Chaos also manifests itself in the heavens. The solar 
system is our cosmos, its slow evolution still strongly imprinted by the 
initial conditions of its formation. Yet it too is open to chaotic behavior. 
The motion of the planet Pluto appears to be chaotic on a time scale 
of tens of millions of years. The large asteroid belt in the vicinity of the 
inner planets is rich in chaotic behavior generating particle orbits with large 
eccentricities which send meteorite material into the earth and mars. 

Since resonances are the seat of chaotic motion, it is chaos which is the 
generator of the regular rings of Saturn and the gaps in the asteroid belts. 
In an interesting twist, the “regular” rings are the former abodes of chaotic 
matter. Chaos sculpts regularity in a dynamical has relief in the following 
way. 

Orbiting material in each planetary ring or asteroid belt has a unique 
orbital frequency associated with the ring or belt. The motion of this ma- 
terial is perturbed by the giant planets and resonances develop for material 
whose orbital frequency is a rational multiple of the orbital frequency of 
the planet. As we saw in the case of the hydrogen atom in a magnetic field, 
small perturbations in energy can nonetheless lead to large perturbations in 
the eccentricity. Rings regularly ordered by rational numbers exist because 
chaos generated by the resonant motion of the bodies in that ring with 
the orbital frequency of the planet sends them off on wildly eccentric orbits 
which collide with other planets thereby removing them and delineating the 
ring or belt. The adjacent ring or belt containing matter is nonresonant.* 

The motion of two celestial bodies is the exemplary cosmic motion. Its 
orderly flow unfolds from and is completely specified by the initial state. 
In the eighteenth and nineteenth centuries this fact was extended to an 
even more sweeping observation based upon the classical equations of mo- 
tion. The Marquis de Laplace proclaimed that given the initial state of all 
the particles of the universe, the equations of motion determine the future 
course of the world. Everything to be is contained in the initial state of 
mechanics — and indeed, by the same Laplacian argument — in the present 
state; but it is only visible to an omniscient being. 

The creation was understood as a great mechanical system, a complex set 
of interacting trajectories projected from their initial conditions and bound 
by the principle of least action. The Creator set up the law of motion and 
the initial conditions and then abandoned the machine to run along on its 
own. 

The uncertainty of quantum motions overthrew this deterministic picture 
of the physical world. Quantum mechanics tells us that any initial condition 



* Wisdom, J., “Chaotic Dynamics of the Solar System,” ICARUS, 72, 241 275, 1987. 
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cannot be specified exactly. Motion evolving from those initial conditions 
is not fully contained within the the initial state in the sense of Laplace. 

Quantum motions are indeterministic in principle because the initial 
state cannot be endowed with all the information required to fully specify 
the motion. Classical many-body motions are deterministic in principle; 
but this determinism leads to chaos. The notions of cosmos and chaos are 
therefore as important in describing the evolution of the classical equations 
of motion as that of their determinism, a distinction unknown to Laplace. 

A cosmic evolution of deterministic equations, such as that governing two 
heavenly bodies, fits appropriately with Laplace’s vision. A deterministic 
mechanics which evolves chaos does not. Classical chaotic evolution, like 
quantum uncertainty, undermines the imperial role of initial conditions. It 
does so because of the sensitive dependence of chaotic evolution to initial 
conditions. 

The role of chaos in nature may be illuminated, not by focusing on the 
connection between a chaotic motion and its initial state, but rather on 
the large-scale connections between different chaotic motions. From this 
perspective, chaos is not formless. It is the building block of form and 
structure on new scales. This means that many-body motion perceived 
from the local scale of distances separating the bodies in any near-integrable 
motion is chaotic; but on the global scale of an ensemble of chaotic motions 
as a whole, the combined motion of all bodies possesses a coherent form 
and structure. Cosmos gives way to chaos out of which new cosmos arises. 



Metamorphosis 

In the limit in which myriad atoms and molecules interact together, in- 
dividual chaotic motions of the particles coalesce to form a continuum — a 
solid, liquid, or gaseous medium. Individual molecular motions lose their 
significance and the motion of the continuum as a whole becomes the dy- 
namically interesting object. Individual microscopic particle motions are 
chaotic; but, remarkably, the motion of the continuum as a whole need not 
be chaotic. 

The continuum of solids, liquids, and gases shows that the integrable 
mechanics exemplified by two-body motion is reborn from chaos — a rebirth 
as revealing as the apparition of Prince Conn’s companion from the remains 
of the shaggy steed. This metamorphosis is revealed in the destruction of 
cosmic two-body motion in a continuum and its replacement by the chaotic 
many-body motion of atoms or molecules within a rigid body. The rigid 
body is a continuum of internal molecular motion, often chaotic, which as 
a whole is cosmic: the motion of a rigid body is integrable. 

Because of its rigidity, the positions of all points within a rigid body are 
fixed with respect to a reference frame (x'^, x' 2 , x' 3 ) embedded within it as 
shown in Fig. 7-8. Their motion is therefore completely determined by the 
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Figure 7-8. The Rigid Body. The motion of every point within a rigid body is completely 
determined by the motion of the reference frame x' embedded in it. The relative motion 
of this reference frame possesses only three degrees of freedom such as the three angles 
<f>(t), V'(t) . 0(1.) (known as Euler angles). The motion of these angles therefore fixes the 
relative motion of every point within the body. 

motion of this reference frame. The specification of the motion of the three 
angles which fix the orientation of this reference frame with respect to the 
center-of-mass frame (x\,X2, £3) is sufficient to describe the relative motion 
of all the points in the body. The relative motion of a rigid body therefore 
has only three degrees of freedom and two Casimir sets of three invariants 
each, 

just like the vector representation of two bodies. 

One sees immediately that generic rigid body motion is integrable since 
its Casimir sets contain three invariants which match the three degrees of 
freedom of the relative motion. Moreover, since there is more than one 
independent Casimir set, rigid body motion is degenerate. But since it 
does not possess three independent sets, it is not fully degenerate unless 
additional symmetries are imposed. The phase flow of a generic rigid body 
is open and densely covers its torus. 

The conservation laws are the only deep laws of motion for the contin- 
uum. Since there are many more degrees of freedom than there are conser- 
vation laws, the internal motions of the particles seemingly defy description. 
But there is a saving feature in the many-body limit of the continuum — 
the ergodic behavior of the chaotic regime: individual particles are free to 
explore the entire phase space of the motion subject only to the constraint 
of the conservation laws. As a result, statistical methods of description and 
analysis of the internal motions becomes feasible. 



7 . The Manifold Universe 



235 




Figure 7-9. Ordered Motion Built upon Molecular Chaos. The vortical motion of a smoke 
ring. The internal motion of the individual atoms and molecules of the air is chaotic; but 
the macroscopic motion which the air as a whole undergoes is highly ordered. [Courtesy 
R. H. Magarvey and C. S. Maclatchy, Canadian J. Phys. 42, 678-683, 1964.] 
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One no longer speaks of individual particle motions in the continuum. 
One speaks instead of the mechanics of the continuum. The mechanical 
quantities inherent in the conservation laws (linear momentum, angular 
momentum, energy) also describe the continuum because the global con- 
servation laws remain universally valid. 

The global conservation laws are in themselves incomplete to describe the 
continuum. New mechanical concepts unique to the continuum and which 
have no significance for an individual molecule come into being. These 
include temperature, pressure, stress, heat, and entropy. These macroscopic 
continuum quantities, rather than the individual microscopic trajectories 
of particles, more properly describe the internal many-body motion of the 
continuum. 

The rigid body is the leading illustration of the new level of cosmos which 
arises at the macroscopic scale of the continuum underlain by microscopic 
chaos. The rigid body is always a cosmic motion. But on the new level 
of structure and dynamics of the continuum, deformable solids as well as 
liquids and gases also acquire a new cosmic simplicity and new levels of 
chaos. 

Another example of the interplay of cosmos and chaos is offered by the 
ultimate many-body medium: the fluid. In a fluid composed of a continuum 
of atoms and molecules, the new cosmos which arises can be ordered. One of 
the most basic structures of the fluid continuum is the vortex: a smoke ring 
shown in Fig. 7-9. Although the internal motion of individual molecules of 
the air and smoke are chaotic, the ensemble motion is ordered into tightly 
bound spirals. 

But the cosmos of the continuum has its own life-cycle. It too dissolves 
into chaos. The motion of the fluid smoke ring is highly ordered as it first 
emerges; but it is a vulnerable order which will grow unstable and break 
up into a new kind of chaos proper to the macroscopic level: turbulent 
motion in which the fluid becomes highly disorganized. One encounters 
a new disintegration of the cosmos of the continuum into the chaos of 
turbulence. The appearance of chaotic orbits in many-body motion on 
the molecular level is recapitulated on the macroscopic level where ordered 
ensemble motion gives way to chaotic motion on the macroscopic level. 

Nature surrounds us with ascending and descending rounds of chaos and 
cosmos. Cosmic two-body motion on the molecular level gives way to the 
chaos of many-body motion which in turn gives way to the cosmos of the 
continuum. But the continuum is open to instabilities which destroy its 
order and once again re-open the way to the chaos of turbulence.* 



* An uncertain quantum fluctuation in one of the electrons of a molecule of air at 
the earth’s surface will disturb the trajectory of that molecule after about one hundred 
collisions. It takes approximately 1 min for such a microscopic fluctuation to reach 
the millimeter turbulence scale of earth’s atmosphere, about one day to grow to the 
kilometer scale, and about one week to affect the global climate. (The global weather 
is not the result of a single fluctuation, quantum or otherwise. Abundant unstable 
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The round continues to higher levels of scale. The earth and sun with 
their turbulent oceans and atmospheres are mere mass points on the scale 
of the entire solar system. On solar system scales these two point masses 
approximately execute two-body, cosmic motion. But on galactic scales 
embracing myriad stars, the universe behaves like a continuous, gravitating 
fluid in which earth and sun are chaotic, many-body participants, much 
like atoms or molecules in a gas or liquid. 

Perhaps the most remarkable aspect of the natural world is the man- 
ner in which each level of cosmos is interleaved with other levels so as to 
develop its own integrity and largely follow its own course. The disparate 
scales of the elementary forces establish the basic levels of structure of the 
universe. For example, a single water molecule consists of nuclei of protons 
and neutrons binding a constellation of orbiting electrons. The dynamics 
of the electrons are not strongly coupled to the internal nuclear states of 
the protons and neutrons which make up the hydrogen and oxygen nuclei 
to which they are bound. The electronic states have organization differ- 
ent from the organization of the nucleus. The electrical force establishes 
a unique scale of structure from that of the nuclear forces. But if only 
the nuclear, gravitational, and electrical forces differentiated matter in the 
universe, the world would be a monotonous one indeed. 

Might it not be that the transitions from regular to chaotic motion are 
the generators of the complex textures of the physical world? The chaotic 
possibilities inherent in Hamiltonian many-body mechanics provide for a 
manifold universe. Chaos makes determinism perishable. If it did not 
do so, large aggregations of microscopic constituents would simply be big 
versions of the same microscopic structure. 

Water again illustrates the point. A many-body ensemble of water 
molecules possesses a variety of forms. Solid water manifests itself as crys- 
talline ice. The crystal lattice is immobile: it is a highly cosmic arrangement 
in which the nuclei of the water molecules are locked in fixed positions. f 
The ensemble is a large-scale replication of its constituent water molecules. 
The patterns of snowflakes directly exhibit the bond angles and symme- 
tries of the underlying angularly shaped water molecules of which they are 
composed. 

In the liquid phase the lattice is destroyed. The water molecules break 
free of their confinement in the lattice, melting into a chaotic, many-body 
motion: the solid becomes a liquid droplet. In contrast to the ordered ar- 
rangement in the immobile solid the molecular many-body motion within 



microscopic fluctuations are growing, competing with one another, and merging into 
larger structures. Some will ultimately grow to the scale of the global atmosphere; but 
they will do so at the expense of others which will ultimately be suppressed.) 

f It is interesting to note that it is the quantum nature of matter that makes liquids 
and the ordered lattice of the solid phase possible. The classical many-body motion 
of point particles does not possess any mechanism for phase changes. Molecular bonds 
creating liquid and solid phases are inherently quantum-mechanical. 
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the liquid droplet is chaotic. The overall structure of a water droplet is not a 
large replica of the water molecules chaotically jostling about within it. The 
droplet has shape, structure, and mechanical organization distinct and dra- 
matically different from that of the individual water molecules which make 
it up. This sharp difference between the liquid droplet and its molecules (in 
contrast to the strong similarities between the ice crystal and its molecules) 
is a result of the chaotic motion of the molecules in the liquid phase. 

The earth is not a big molecule. It is a decidedly different structure built 
from molecules. Chaos limits the mechanical organization which resides in 
the delicate determinism of one level while a new mechanical organization 
and an equally perishable determinism coexists at another level. 

Continue to yet higher levels of the universe. The wind currents of 
a thunderstorm containing massive numbers of water droplets do not turn 
sensitively on the motion of a single water droplet within the storm’s down- 
draft. The structure and motion of a thunderstorm globally shaped by its 
massive anvil cloud are uniquely different from the structure and motion of 
an individual water droplet flying about in the turbulent cascade within it. 
Yet the storm itself, arising out of the moist unstable atmosphere owes its 
existence to the water vapor which has formed these droplets. 

The global motion of the earth’s atmosphere is not the result of a sin- 
gle thundershower. The atmosphere is rich in fluid structures — cyclones, 
frontal systems, jet streams, boundary layers, cumulus towers — of which a 
thunderstorm is only one among many. Chaos provides the means of differ- 
entiation through which each uniquely different fluid structure acquires its 
own identity. Operating on the microscopic motions of atoms and molecules 
bound by electrical forces up through macroscopic turbulence driven by 
gravity and the heat of the sun, the chaos inherent in Hamiltonian mechan- 
ics limits the determinism of each level thereby delineating one structure 
from another. 

Continue beyond the planetary level. The spiral motions of galaxies are 
coupled in only the most tenuous way to the motions of a single star let 
alone to the atmosphere of one of the planets of that star. Yet all these 
levels are interleaved with one another and each is the stuff of its underlying 
constituents from the largest galactic scale down to the protons, neutrons, 
and electrons of the atomic and nuclear level. 

The world sustains exceedingly complex motions such as those of the 
atmosphere, oceans, and galaxies. The general equations of motion of the 
continuum are not integrable (indeed, cannot be integrable if they are to 
faithfully describe the complexity of the world). But these equations do 
contain some surprisingly simple solutions — exact integrable solutions no 
less — which frequently appear in nature. (Nature contains structures of 
pure simplicity as well as tangled complexity!) 

One class of these integrable motions are those for solitary waves — or 
solitons as they have come to be called. These are exact wave- like solutions 
of the fluid equations which consist of a solitary wave which propagates 
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without change of shape. The first soliton was worked out by Lord Rayleigh 
in 1875 to describe a dramatic solitary wave observed in an English canal 
by the engineer and naval architect Scott Russell. Russell chased the wave 
on horseback for the better part of two miles until it escaped him. It was 
thirty feet long, a few feet in maximum height, and moved unchanged in 
shape at a speed of nearly ten miles per hour. Another example of a solitary 
wave embedded in a turbulent planetary atmosphere is the great red spot 
of Jupiter. 

The transformation of the cosmic motion of two-bodies and its appari- 
tion in the many-body motion of the continuum is an engaging round in 
the cycles of motion. Chaotic evolution severs mechanics from a rigid ca- 
pitulation to initial conditions. The products of the initial conditions in 
chaotic molecular motion are the raw materials for cosmos at a larger scale. 
Chaotic evolution supports the rich structure of the world in which new lev- 
els of order involving complex molecular structures including the molecules 
of life and living organisms come into being. 

Mechanical law is, without exception, validated as the necessary condi- 
tion of all motion; but in its inherently quantum and chaotic evolution, it 
introduces both uncertainty and complexity into mechanics. It governs the 
ground of motion while leaving open the kinds of structures which evolve 
and play upon that ground. We ourselves illustrate this fact. On the 
small scale of the organic molecules which make up the fluids and tissues 
of our bodies, motion is one of many-body chaos; but on the large scale of 
our tissues, organs, and nerves built from myriad molecules, we are highly 
structured creatures. The world sustains large-scale order and structure 
built upon an underlying classical chaos and quantum uncertainty. How it 
does so is an inviting mystery. 



Finale 

The Shaggy Steed of Physics reveals the unity and simplicity of the 
heavens and the elements while at the same time it confronts us with the 
inexhaustible complexity and mystery in which such simplicity resides. The 
universe seems to be embraced by ultimate cosmological limits: an origin 
in the Big Bang, expansion, and possibly recompression back to the origin 
from whence it began. Within this envelope there flows a bountiful creation 
with rich dynamics on a vast series of smaller scales. Mechanical law binds 
the elemental fabric of the universe giving it warp and weft. But the fabric 
seems otherwise free to flow and gather shape into the more complex struc- 
tures of creation from galaxies, stars, planets, and the geological structures 
up through the flow of being to complex living creatures. 

The many-body motion of the universe manifests a freedom from the 
tyranny of initial conditions while it is still very much a cosmos sustained 
by the law of motion. The creation seems to spring anew at each stage of 
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its complexity, appropriately incorporating its historical structure from the 
past but with open possibilities for the future. This evolution, the unfolding 
of the old and the knitting together of the new, is shrouded in a mystery 
in which the law of motion is the ground. But that which breathes life into 
the law of motion and each unfolding event transcends initial conditions. 
Mechanics is not the determinant of the universe but the vivified space and 
time of an ongoing creation. 




People 



The story of the physical world is told by people. It has a human face. Here 
is a collection of brief biographies of some of the principal figures in the story 
of the heavens and the elements. My selection is not meant to be complete; 
many of overshadowing importance are not represented — Copernicus and 
Bohr just to name two. I have chosen those with a particular connection 
to the law of motion and the Kepler problem and its topology and symme- 
tries. They are Europeans for the most part — but they represent the entire 
continent eastward to Siberia and westward to the British Isles and Ireland, 
northward to Scandanavia and south to Italy. They came from scholarly, 
clerical, merchant, or banking backgrounds with a notable exception: Isaac 
Newton was born into a farm family. 

Some, like Newton and Kepler, are household words. Others, like Heinz 
Hopf, are little known. Still others like Emmy Noether have only recently 
been celebrated. Jacob Hermann has lain in obscurity for two centuries; 
however the late twentieth century is discovering him.* 

The contributions for which some of the individuals are remembered 
were brief moments. Emmy Noether’s work on the symmetries of the vari- 
ational principal was a youthful discovery. Most of her life was devoted 
to the problems of abstract algebra. Wolfgang Pauli is more notably re- 
membered for the neutrino and the exclusion principle than his elegant use 
of the eccentricity invariant in the construction of the hydrogen spectrum. 
Jacobi’s canonical transformation theory is a key aspect of both classical 
and quantum mechanics; but he himself was most devoted to his theory 
of elliptic integrals. On the other hand, Mendeleev was passionately occu- 
pied with the Periodic Table for his entire life, Kepler was consumed with 
the problem of the planetary orbits, and P. A. M. Dirac’s creative energies 
rarely strayed from the central problems of quantum mechanics. 



* A number of works on Hermann fostered by the Bernoulli Edition, Basel are appear- 
ing. See F. Nagel, “A Catalog of the Works of Jacob Hermann (1678-1733),” Historia 
Ma thema tics, 18 (1991), 36-54 and “The Life and Work of Jacob Hermann” in the same 
issue. 
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Johannes Kepler 

1571 Weil der Stadt, Germany-1630 Regensburg, Germany 
(Courtesy AIP Emilio Segre Visual Archives) 

From an early age, Kepler was passionately dedicated to discovery of the geometry 
of the heavens. It was Kepler’s happy fate that his imaginative powers found their 
way to a close (if often strained) collaboration with Tycho Brahe, a skilled observer. 
Following Tycho’s death, Kepler was the recipient of the finest astronomical data of the 
time. The first fruit of Kepler’s fertile imagination was quite misguided. Kepler found 
that if one nests the classical solids — cube, tetrahedron, dodecahedron, icosahedron, and 
octahedron — the ratios of the spheres enclosed between them are fair approximations to 
the ratios of the orbits of the planets. While a brilliant tour de force , the discrepancies 
were not insignificant and the solids lacked dynamical significance; Kepler lost interest in 
the scheme. Nonetheless, it was this kind of inspired exploration based upon Copernican 
heliocentricity that ultimately led him to the ellipse and its area, and period law, 20 
years later. 
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Dmitry Ivanovich Mendeleev 

1834 Tobolsk, Siberia— 1907 St. Petersburg 

(Courtesy AIP Emilio Segre Visual Archives, W.F. Meggers Gallery of Nobel Laureates) 

Mendeleev was a devoted student of chemistry for his long and productive life with wide- 
ranging interests from theoretical chemistry to cheesemaking, mining, and petroleum. 
Yet, the periodic arrangement of the elements overshadowed all other interests. As Kepler 
sought the underlying unity of the heavens, Mendeleev sought the underlying unity of 
the elements. The crucial step came in an inspired moment in 1869 when, working on his 
treatise Osmovy khimii , it suddenly struck him that, contrary to the established theory of 
types, he should place metals — copper and silver — next to the alkalis. This led him to use 
atomic weight as a basis for ordering the elements. Mendeleev then made a deck of cards, 
each face an element, and began arranging them in various groups. This allowed him 
to recognize periodicity. Subsequent refinements revealed errors in established atomic 
weights and gaps that led to the prediction of elements not yet discovered. Although 
the Periodic Table was first resisted, the subsequent discovery of the predicted elements, 
particularly gallium and germanium, ushered in its total acceptance by 1880. 
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Isaac Newton 

164-3 Woolsthorpe Lincolnshire, England — 1727 London 
(Courtesy AIP Emilio Segre Visual Archives, W.F. Meggers Gallery of Nobel Laureates) 

A growing wave of mathematics and mechanics spurred by the Copernican revolution 
and the revival of learning in the renaissance reached an apotheosis in Isaac Newton, a 
child not of aristocratic or scholarly origins (as was the case of most mathematicians and 
scientists), but of yeomen. Many of Newton’s most important discoveries came about 
during a seminal two-year period in his early twenties when, having been exposed to the 
works of Euclid, Galileo, Kepler, Hooke, Descartes, Walis, and Schooten, and with Cam- 
bridge University closed because of plague, he was free to take up his own thoughts in 
seclusion. Newton’s abundant gifts produced the differential calculus, the foundations of 
classical mechanics, and epochal discoveries in optics, mathematical analysis, and geom- 
etry. The Kepler ellipse, the area law, and the period law all received their explanation 
in the Newtonian synthesis of the equations of motion and the inverse-square law of 
gravity. 
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Paul Adrien Maurice Dirac 
1902 Bristol, England — 1984 Miami, Florida 
(Photo by A Bortzells Tryckeri, courtesy AIP Emilio Segre Visual Archives) 

P. A. M. Dirac was a gifted mathematical inventor who saw how quantum mechan- 
ics rises from classical mechanics, yet transcends it. Dirac did not know of the Bohr 
atom when he arrived at Cambridge in 1923; yet he quickly began contributing to the 
mathematical structure demanded by quantum phenomena, discovering the connection 
between the Poisson bracket and the commutator of Heisenberg’s matrix representation 
of observables. Then, with careful attention to its classical antecedent, Dirac found the 
equation governing the evolution of the matrix elements which had eluded Heisenberg 
in the operator ihdA/dt = AH — HA. He then went on to discover spinors in describing 
the relativistic electron and anti-matter implied by the quantum in relativistic space- 
time. Dirac conceived the many-time formulation of relativistic quantum mechanics and 
laid the foundations of the Feynman path integral — thereby opening the way to quan- 
tum electrodynamics. Newton synthesized the foundations of classical mechanics. In 
fitting kinship, Dirac, who did the equivalent for quantum mechanics, filled the chair at 
Cambridge held by Newton. 
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Leonhard Euler 

1707 Basel — 1783 St. Petersburg 

A watershed of classical mathematics was reached under Euler in number theory, analy- 
sis, geometry, and topology (Euler gave the solution to the famous problem of the seven 
bridges of Konigsberg). In so doing, he conceived virtually all the modern notation of 
mathematics. Euler was an important contributor to problems of physics and engineering 
including celestial mechanics, elasticity theory, and the mechanics of fluids. Maupertuis, 
the president of the Berlin academy of which Euler was the most fruitful member, had 
begun to formulate problems in mechanics as problems of minimizing path integrals of 
functions such as the velocity of a particle. Euler was the first to systematize Mauper- 
tuis ’ ideas in a calculus of variations and to create general methods for their solution. 
He showed how minimization of the integral of a function between two fixed points could 
be reduced to the solution of a differential equation, the now famous Euler-Lagrange 
equation. 
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Joseph-Louis Lagrange 

1736 Turin — 1813 Paris 

(Courtesy AIP Emilio Segre Visual Archives, E. Scott Barr Collection) 

Lagrange was born into a family of civil servants and destined for the law; but his 
natural gifts turned him to mathematics. In a youthful discovery he found that he could 
express Euler’s equation for the function minimizing a functional integral (the equation 
that would come to be known as the Euler-Lagrange equation) in a neat and compact 
form using integration by parts (illustrated in Chapter 3). Recognizing the beauty 
of this method, Euler recommended him to Maupertuis who was eager to encourage 
research in variational methods. Lagrange then generalized Euler’s variational calculus 
so that it could handle all the problems of mechanics. He refined the choice of the 
integrand of the functional appropriate to mechanics, introducing the “Lagrangian,” 
L = T — V . (Maupertuis and Euler had used what amounted to only the kinetic energy 
which then required additional side conditions.) Lagrange went on to make important 
contributions to a wide range of mathematical subjects including the general theory of 
small perturbations of the solar system. 
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William Rowan Hamilton 

1805 Dublin — 1865 Dunskirk Observatory, Ireland 

The celebrated equations that now bear his name were, for Hamilton, an adjunct of 
a more fundamental interest in optics. Like Jacobi, Hamilton knew the formal con- 
nection between point mechanics and geometrical optics for which Fermat’s principle 
corresponds to the action principle for particles and light-ray surfaces of constant phase 
(which Hamilton called his “characteristic function”) are analogous to level surfaces of 
action. He solved a variety of optical problems with the characteristic function includ- 
ing a prediction that a light-ray incident on a biaxial crystal should emerge as a hollow 
cone. Its demonstration two months later caused a sensation in the scientific community. 
Hamilton’s use of the eccentricity invariant of the Kepler problem occurred in relation to 
his application of the characteristic function to problems of celestial mechanics. Hamil- 
ton’s other beautiful discovery was that of quaternions, a four-dimensional generalization 
of the two-dimensional complex variable. The algebra of these quantities is the same as 
that of four-dimensional rotations, an amazing coincidence with the symmetry algebra 
of Kepler motion of which Hamilton was not aware. 
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Carl Gustav Jacob Jacobi 

1804 Potsdam — 1851 Berlin 

After schooling himself in the works of Lagrange and Euler while a student at Berlin, 
Jacobi secured a position at Konigsberg where a voluminous outpouring of work chiefly 
centered on elliptic integrals followed. Jacobi brought to further perfection the methods 
begun by Euler and refined by Lagrange by developing the most general transformations 
of the Euler-Lagrange equations (now known as canonical transformations) and showed 
that the trajectories could be obtained by solving a first order partial differential equation 
for the generating function of a canonical transformation as an alternative to solving the 
Hamilton equations (see Action Redux, Chapter 3). The methods of both Jacobi and 
Hamilton showed the strong formal similarity between the mechanics of point particles 
and the optics of waves. They therefore showed the way in which quantum mechanics 
and classical mechanics merge and, remarkably, presaged the development of de Broglie 
matter waves, the Schroedinger equation, and ultimately the Feynman path integral. 
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Jacob Hermann 

1678 Basel— 1733 Basel 

(Courtesy Bernoulli Edition, Basel; original in private hands) 

Jacob Hermann was born into the mathematical ferment sparked by the famous Bernoulli 
family centered in Basel. He became an early champion of the calculus of Leibniz, 
winning the favor of the master. Teaching positions in central Europe were scarce (and 
the Bernoullis who filled them numerous) so Hermann was forced to seek a post in 
Padua, Galileo’s old haunt. It was in Padua that he completed his most important work 
in celestial mechanics including the discovery of the hidden invariant of Kepler motion 
culminating in his magnum opus, the Phoronomia. He then moved to St. Petersburg 
where he worked freely with Euler and Daniel and Nicholas II Bernoulli on problems 
of algebra and motion. Hermann was well remembered and his works seriously studied 
in the eighteenth century along with those of the Bernoullis and Leibniz (his work was 
highly regarded by Immanuel Kant). He then seems to have been forgotten for 200 years 
but is now being rediscovered by the twenty-first century. 
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Wolfgang Pauli 
1900 Vienna — 1958 Zurich 
(CERN, courtesy AIP Emilio Segre Visual Archives) 

Gifted by native intelligence and family background (his father was a distinguished pro- 
fessor of Chemistry at Vienna and Ernst Mach was his godfather), Pauli was one of 
physics’ child prodigies. At the age of 19 he wrote the article on Einstein’s theory of 
relativity for Felix Klein’s Encyclopadie der Mathematischen Wissenschaften, a brilliant 
exposition which remained definitive for over half a century. Pauli remained an impor- 
tant and often severe critic of Einstein’s field theories; but his own work was centered on 
the problems of quantum mechanics. He was the first to present the complete solution of 
the quantum motion of the hydrogen atom for which he used the eccentricity invariant 
he had learned from Lenz (It is unlikely Pauli knew of Jacob Hermann’s work). The 
puzzling behavior of atoms in a magnetic field was solved by Pauli’s discovery of the 
exclusion principle and the important role of spin in the atom: the Lorentz invariance of 
space— time requires that electrons possess spin and that no two electrons can be in the 
same state. His last notable discoveries were the neutrino, required for conservation of 
energy in radioactive decay via the weak nuclear force, and the celebrated spin-statistics 
theorem. 
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Amalie Emmy Noether 

1882 Erlangen, Germany — 1935 Bryn Mawr, Pennsylvania 
(Courtesy SPL/Photo Researchers, Inc.) 

“Emmy Noether,” wrote Herman Weyl in her obituary, “was born into a Jewish family 
distinguished for the love of learning and championed by the great Gottingen mathe- 
matician, David Hilbert.” Nonetheless, she was barred from academic standing because 
she was a woman. Emmy Noether was still able to gather around her a notable group 
of students including the Dutch mathematician B. L. van der Waerden. She traveled 
extensively to other institutions and had an international array of colleagues including 
the distinguished Russian mathematician P. S. Alexandroff. Her discovery of the connec- 
tion between the symmetries and invariants of the variational calculus was stimulated by 
Felix Klein during her early years at Gottingen. She was summarily dismissed and the 
Gottingen Algebraic School disbanded when the Nazis came to power. She emigrated to 
the United States where she lived out the last year and a half of her life at Bryn Mawr 
College. 




People 



253 




Heinz Hopf 

1894 Breslau, Germany — 1971 Zollikon, Switzerland 
(Courtesy G. Alexanderson, from the Polya Picture Album ) 

Heinz Hopf was inspired by the power of algebraic methods to reveal important properties 
of geometry and topology in attending lectures on the work of L. E. J. Brouwer. Following 
his studies at Breslau he passed through the vibrant mathematical circle at Gottingen 
where he became acquainted with Emmy Noether and began a lifelong correspondence 
with P. S. Alexandroff. One of Hopf’s great discoveries, completed in 1931, was that 
of the topological relationships between the one-, two-, and three-dimensional spheres. 
(These relationships have at their core the manner in which each higher dimensional 
sphere is foliated with those of lower dimension.) The Hopf spheres are the basis of the 
symmetries of Kepler motion, though Hopf approached them as purely mathematical 
objects and was not aware of their physical connection to the motion of the solar system 
and the hydrogen atom. 
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Hedrick Antoon Lorentz 

1853 Arnheim, Netherlands — 1928 Haarlem, Netherlands 
(Algemeen Rijksarcheif, The Hague, courtesy AIP Emilio Segre Visual Archives) 

Maxwell’s electromagnetic field equations received their connection with the sources of 
these fields (the Lorentz force law) in the work of Lorentz whose substantive work was 
on the theory of the electron. Lorentz was a gifted lecturer and writer who became the 
leading statesman of the late nineteenth century world of physics; he also inspired great 
affection and was beloved by generations of students and colleagues, most notably Ein- 
stein. Lorentz discovered the transformations of space— time required for the invariance 
of the Maxwell equations leading, incomprehensibly, to the relativity of time and Lorentz 
contraction of bodies. Einstein used Lorentz transformations as the definitive insight into 
the geometric structure of space-time; but Lorentz, to the end of his life, regarded them 
as a mathematical artifice that would ultimately be reinterpreted in classical concepts 
based on absolute time. 
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Jules Henri Poincare 

1854 Nancy — 1912 Paris 

(Courtesy AIP Emilio Segre Visual Archives, Percy Bridgman Collection) 

Poincare is a pivotal figure for both physics and mathematics in the twentieth century be- 
cause he sparked a development immensely enriching both: the role of topology. It should 
not be forgotten Poincare had independently of Lorentz and Einstein found the transfor- 
mations of Maxwell’s equations which preserve their invariance and, more importantly, 
recognized their geometric and group-theoretic significance. Poincare’s geometric and 
topological insights were of a general and abstract nature embracing all manifolds, not 
just those of space-time. Thus, the phase space was also deeply interesting to Poincare 
and he initiated the systematic exploration of the connections between the topology and 
stability of mechanical systems, introducing mappings of manifolds as representations 
of the motions of mechanical systems. Topological features of mapped manifolds then 
reveal crucial qualitative properties of motion such as their stability. Poincare showed, 
among other things, that one can derive this information from a study of the fixed points 
of mappings, a powerful concept which has grown to abundant use in the late twentieth 
century. 




Notes 



Note 1 

Variational Principle 

The law of motion has a basis in a variational principle. All motion may be 
gathered up in a single mechanical quantity, the action S. The trajectories 
which particles take to get from one point in space and time to another are 
those for which the action takes the least possible value between the two 
points.* 

The statement that the action shall take a minimum value between any 
two points in space and time is simple. The final laws of motion which 
result from this statement are also simple. But the pathway of deduction 
from one to the other is not. The following is a guided tour along this 
deductive path. 

Let L = L(x a ,x a ) be the Lagrangian function of the rectangular posi- 
tions x a and velocities x a of a system of particles a = 1,2,... . The action 
is defined as the integral of the Lagrangian over the time path of the system 
between any two time instants 1 , and f 2 : 

pt2 

S= L(x a ,x a ,t) dt. (1.1) 

Jtx 

To find the functions x a (t) for which S is stationary, the functions x a (t ) 
and x a (t) are given variations which displace them to x' a (t) and x’ a (t) while 
still allowing them to pass through fixed end points (x Q )i and (x Q ) 2 . The 
time t is also given a variation which displaces it to t'. 

The displaced coordinates (x' a ,x' a ,t') are not only generated by the path 
variations Sx a , Sx a ; they are also generated by variations in the symmetry 
parameters describing the homogeneity and isotropy of space and time: <5a, 
Su, <517, <5r. The path variations are described with respect to a given 
reference frame. The symmetry parameter variations correspond to the 
variations of the reference frame. The displaced coordinates are given in 
terms of these variations by 

x' a = x a + Sx a + <5a + Sut. + <517 x x a , 
x' a = x a + Sx a + Su, ( 1 . 2 ) 

t' = t + St. 



* More precisely, the action must be stationary which, over finite segments of time, 
allows not only minima, but maxima and inflection points. However, for infinitesimal 
segments, the action is always a minimum. 
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The variation of S now takes the form 



SL(x a , x a ) dt. 



rt2~\~St2 rt\+8t\ ' ' ' 

+ / L(x a ,x a ) dt — / L(x a ,x a ) dt. 

Jt 2 Jt i 

The two terms on the second line of Eq. (1.3) appear because of the vari- 
ation of the time. 

The variation SL(x a , x a ) may be expressed to first order in the displace- 
ments of the coordinates as 

SL{x ai X a ) = ^ ■ « - x a ) + ■ (. x' a - x a )\ 



and the variations x' a , x' a , t' expressed in terms of Sx a , 5x a , St and the sym- 
metry parameters through Eq. (1.1). The variation of S given by Eq. (1.3) 
then becomes, to terms first order in the variations, 

^ + •*>)<* 

A* 2 x dL f t2 . dL 

+ / y; — — ■ (Sa + Sut + Sfl x x a ) dt + / y -ryr— ■ Sudt (1.4) 



pt2+8t2 pti+Sti 

+ / L{x ai x a ) dt - / L(x a ,x a ) dt. 

Jto Ju 



The first term on the second line of Eq. (1.4) vanishes since, as described 
in Chapter 3 in the discussion surrounding Eqs. (3.7), the Lagrangian of an 
isolated system of particles satisfies the conditions 

\ ' dL \ dL 

E^ = 0 ’ E x “ x ^ = °- 



To the approximation of first order in the variations, the last two terms 
in Eq. (1.4) reduce simply to the product of the Lagrangian and time 
variations at the end points of the path: (L6i)\t 2 — (LSt) | tl . 

The term involving ( dL/dx a ) • Sx a on the first line of Eqs. (1.4) may be 
transformed into one involving Sx a through inegration by parts as in the 
transition from Eq. (3.9) to Eq. (3.10). Substitution of these results into 
the general variation (1.4) and integration of the term d/dt(dL/dx a ■ Sx a ) 
leads to 



OLj 

+ Efl— ,5xQ 



1*2 r t. 2 



y^dt-su 



+ {LSt) \t 2 — ( LSt)\ tl . 
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It is important to observe that the boundary term in the integration by 
parts, J2 a {dL/dx a ) ■ 6x a \£, does not now vanish when the time t as well 
as the paths x a are varied. The requirement that the varied paths all pass 
through the same end points is that x' a \ t^+st! = x Q k, not x' a \ tl = x Q | fl . 
Thus the path variation at an end point, such as dx Q | tl , does not vanish as 
it does in Eqs. (3.9) and (3.10) where the times are held fixed and symmetry 
parameter variations are not considered. 

The variation at fi such as (>x Q | tl contains a contribution due to the 
variation of the time at the end point given by 

Sx a k+5ti <5x a k -T (x a 6tij k . 

A similar connection exists between Sx a \ t2+ st 2 and Sx a \ t2 . These results 
and the condition 5t\ = (5t 2 = St from the third of Eqs. ( 1 . 2 ) may be 
incorporated into Eq. ( 1 . 5 ) resulting in the action variation 




where A indicates the difference of values between end points of the path 
of any quantity before which it stands: A() = () 2 — ()i- 

The path variations Sx a at the end points which appear in the term 



A 




dL/dx a ■ Sx a 



on the second line of Eq. ( 1 . 6 ) do not vanish in the presence of symmetry 
parameter variations. Since the end points of the trajectories are fixed as 
Xak+ati = and x' a \ t2+ st 2 = x a |t 2 , the end point variations of the 

paths must be canceled by the variations of the symmetry parameters or, 
according to Eqs. ( 1 . 2 ), 

<>x a k = -6a- Sut! - Sfl x x Q k , 



with a similar condition for the end point at t 2 - Substitution of these 
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conditions into Eq. (1.6) yields the action variation 




The variation of the action 6 S thus results from the trajectory variations 
5x a , the origin shift 6 a, the space translation Su, the space rotation Sfl , 
and the time shift St. These variations are arbitrary and independent of 
one another. Hence, their coefficients must vanish if the variation SS in 
Eq. (1.7) is to vanish. 

A vanishing variation SS corresponding to the variations Sx a results in 
the equations of motion 

d dL dL 

dt dx a dx a 



The last four lines of Eq. (1.7) are in the form of conservation laws: 
a dynamical quantity is unchanged between any two arbitrary end points 
of the path of motion. The first conserved quantity corresponds to the 
invariance of the law of motion to arbitrary shifts of the origin of the space 
coordinates: 




( 1 . 8 ) 



The second corresponds to invariance to velocity space translations 



a / dL \ f^ 2 dL 

A ( L alT* ) / . LI AT - dt = °- 



' 1 1 



(1.9) 



The third corresponds to invariance to space rotations: 

A (S>* !£)=«• i 1 - 10 ) 

x a / 



The fourth corresponds to invariance to shifts in the origin of the time 
coordinate: 




= 0. 



( 1 . 11 ) 
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The Lagrangian and its derivatives which appear in these expressions are 
recognizable as the total momentum P, angular momentum J, and energy 
H of the system: 



P = E 

Ot 



dL 
dx Q ’ 



J = Il x ‘*§r- 



( 1 . 12 ) 



Since the momentum is p a = dL/dx a = m a dx a /dt, the second term on 
the third line of Eq. (1.7) has the integral 




dt = y^(m G 



*)2 - XX 



m n x n 



A(toX), 



(1.13) 



where m = m a is the total mass of the system and X is the coordinate 
of the center of mass defined by mX = m a x a . The third line of 
Eq. (1.7) therefore involves the conservation law for the mass-center N 
defined as 

N = — — f — m a x a = Pt — mX. (1.14) 

' ox a ' 

a a 

The general variation (1.7) in these variables corresponding to the sym- 
metry parameter variations is then 



^symmetries = “A P ■ 5a — AN ■ 5u — A J ■ 5 ft — AH5t. (1.15) 

The overarching condition that action shall be a minimum leads to two 
kinds of results: equations of motion and conservation laws The conserva- 
tion laws are a result of a vanishing action variation due to the symmetry 
parameter variations given by Eq. (1.15). These flow from the symmetries 
of the Lagrangian and reveal the invariants of motion. For Galilean rela- 
tivity (1.2) these invariants are P, IV, J, and H. They are ten in number 
and correspond to the ten symmetry parameters of the Galilean group. 



Note 2 

Closed Orbits of Two-Body Motion 

The bound motion of two bodies takes place in an effective potential: 

W{r) = M 2 / 2 pr 2 + V(r). ( 2 . 1 ) 

The potential V (r) may be quite general. However, of this general fam- 
ily of potentials, only two potentials generate orbits that are closed: the 
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attractive inverse first power potential V = —k/r (corresponding to the 
inverse-square force f = —kr/r 3 ) and the attractive quadratic potential 
V = nr 2 / 2 (corresponding to the linear force f = —nr). 

If the motions are bound, the potential W (r) which corresponds to V (r) 
must possess a local minimum W (ro) = So at the point r = ro and a positive 
second derivative W"(r) > 0 as shown in Fig. 2-1. Turning points of the 
radial motion (f = 0) correspond to the intersections of the potential curve 
W(r) with the energy S. These intersections are the roots of the condition 

£ = W(r) = M 2 / 2 pr 2 + V(r). (2.2) 

The radial motion of the orbit consists of a libration “to and fro” between 
these two turning points designated r = r m i n and r = r max . For r m i n = 
v max = fo, the radial motion vanishes and the orbit is a circle of radius ro- 
An orbit will be closed if the angle A through which the position vector 
turns from r = r m ;„ to r = r max and back again is a rational fraction of 
2-7 t: A(f> = 2Tnn/n where in and n are integers. In that event, after n 
revolutions of the angle (f > , the radial coordinate will have completed m 
libration periods corresponding to m complete rotations of the position 
vector. The position vector will then be back precisely at the point from 
which the motion started bringing closure to the orbit. The behavior of the 
angle A(j> reveals the closed and open nature of the orbit. 

The orbit angle is given by Eq. (3.50) and the angle corresponding to 
one libration period is 



Acj) = 2 



M/r 2 

yJM£ - W) 



dr. 



(2.3) 



The angle integral can also be expressed in terms of the perpendicular 
momentum p± = M/r of Eq. (4.4) in terms of which it becomes 



A(j)=2 



rP J-n 



dp± 

sj2p(£ - W ) ' 



(2.4) 



The condition (2.2), in terms of r = M/p±, becomes 
W(M/p ± ) = p\/2p + V(M/p ± ). 

The libration limits of the perpendicular momentum follow directly from 
Eq. (2.2) and are the roots p± = pj_ m i n and p j_ = pj_ ma x in 

£ =p 2 ± /2p+V(M/p ± ). (2.5) 



First examine the orbits close to the minimum energy point r = ro- 
Such orbits are near-circular orbits. At the minimum energy point the 
first derivative W'(ro) vanishes. According to Eq. (2.1), a vanishing first 
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Energy 




r 



Energy 




r 



Figure 2-1. Effective Potentials for Two-Body Motion. Bound motions take 
place centered about an effective potential W (r) which is convex with a positive second 
derivative W" > 0. The points r m i n and r ma x correspond to the intersections of the 
potential energy curve with the total energy E. The radial motion consists of a libration 
“to and fro” between r m [ n and r max . The point 7*o corresponds to the minimum energy 
point W(ro) = Eo where the first derivative W'(ro) vanishes. In (a) the potential W(r) 
for the attractive direct power-law potential V (r) = Kr & for k > 0, (3 > 0 is illustrated. 

In (b) W(r) for the attractive inverse power-law V(r) = —kr@ potential corresponding 
to (3 < 0 is illustrated. Bound motions only exist for E < 0 in this case. 

derivative of W(r) imposes a condition on the first derivative V'(r). The 
conditions at the minimum point are 

W'(r 0 ) = 0, roV'(ro) = M 2 //zr 0 2 . 

The potential may be expanded about the minimum point as 

W(r) =£ 0 + |W"(r 0 )(r - r 0 ) 2 H . 

The libration point condition possesses a similar expansion: 

£ = W (r max ) = £ 0 + |TT ,, (?’ 0 )(r max - r 0 ) 2 H . 
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[Note that linear terms in W'(rV)(r — ro) do not appear in these expansions 
since W (ro) vanishes at the minimum energy point.] 

If one keeps only terms up to second order in S — W, the energy is 
symmetric about the minimum point with r max — ro = — (r m i n — r o) = 
\/2(£ — So) /W" . The angle ( 2 . 4 ) for near-circular orbits is then 



A0 = 4 



I M 2 //j,ro 2 
r 0 2 W" 




?'o dr 

V / ( 7 'max-r 0 ) 2 -(r-r 0 ) 2 



which is 



A <f> = 2 tt\ 



I M 2 / )iro 2 
r 0 2 W" ' 



The second derivative W" and the term M / firp 2 may be eliminated in favor 
of derivatives of the potential V(ro) through Eq. (2.1): 



M 2 /nr 0 2 = r 0 V’, W" = W /r 0 + V " , 



resulting in the angle 



I V’ 

Ad> = 27 r \ / . 

v V r 0 V" + W 

For arbitrary potentials, the angle depends upon the size of the circular 
orbit ro about which the near-circular orbits exist. If the angle is to be 
independent of the size of the orbit, the potential V(r) must satisfy the 
differential equation 

rV" + 3 V 7 



where a > 0 is a constant. The solutions of this differential equation are 
V(r) = kr 13 , (3 ^ 0; V(r) = fclnr, (3 = 0. (2-6) 



The exponent (3 > — 2 is related to the constant a as (3 = (1 — 2a) /a. Only 
power-law and logarithmic force laws yield angles which are independent 
of the size of the orbit. The corresponding angle for all near-circular orbits 
which are independent of size is 



A(j> 



2n 

VP + 2 



( 2 . 7 ) 



Since the logarithmic potential (3 = 0 leads to an angle A <f> = V^ 
which is an irrational multiple of n, closed orbits can exist only for power- 
law potentials V = kr@ . But this still leaves a wide range of values of 
the exponent (3 > —2. Since these conditions pertain to orbits which are 
near-circular, they are necessary rather than sufficient. To find sufficient 




Note 2. Closed Orbits of Two-Body Motion 



265 



conditions, orbits in other limits which deviate from circularity must be 
examined. 

Consider all direct power-laws V = nr 13 , (3 > 0. These orbits deviate 
from circularity by the maximum amount in the limit £ — > oo. In this limit, 
the roots of the turning point condition ( 2 . 5 ) become 

Pimax/ 2 M->£, K^/Pi 



and the integrand 1/ -^/2p(£ — W) will vanish except in the regions p± ss 
p_i_mm and p_ i ss p_i_ max . Since these regions are well separated from one 
another, the angle integral may be broken into two parts and the lower and 
upper limits extended to zero and infinity respectively: 



A0 = 2 



dp . l 



-2 



dp± 



0 \ZTPl_ max P. l) dp- Lmin yj 2pnMf ) {j) 1 ^ nin — p ± 13 ) 

The integrals may be rearranged to 



A<f> = 2 



df 



- 2 



(/3+2)/2 ,.oo 
P _Lmin / 



df 



y/2nKMP J 1 y/1 - f-f 3 



( 2 . 8 ) 



with the substitution £ = pj_/p_Lmax in the first and f = p±/p± m in in the 
second. In the limit £ — > oo, there results p_Lmin — y 0. The second integral 
of Eqs. ( 2 . 8 ) therefore makes a vanishing contribution and the first becomes 



A(f> = 2 



df 






( 2 . 9 ) 



The two conditions ( 2 . 9 ) and ( 2 . 7 ) may be combined to yield the single 
condition tt = 2n/y/P + 2 or ft = 2. Only the direct power force law 
V = nr 2 yields closed orbits. 

Consider now the inverse-power potentials for V = — kr @ for which —2 < 
f3 < 0. In contrast to the direct power-law potentials which are positive 
for bound motions, the inverse power-law potentials must be negative. The 
circular orbit for this potential occurs at the minimum energy point £o ^ 0. 
The orbit which deviates most from circularity is that for £ = 0. For 

£ = W{M/px) = p\! 2 ii + kM 0 pf t3 , 



the roots of the turning point condition at £ = 0 are 

P± min = 0, P±max = (2 ( 2 . 10 ) 



The angle integral is then 



A(f=2 



/•Pin 



dp± 



2 m( 



kM^p^ _ p 2 ^/2pj 
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The term kM@ may be eliminated in favor of pj_ m ax through the second of 
Eqs. (2.10) and the angle integral simplified to 

A4, = 2 l! = 2 '( 2 + «• (“D 

Combining Eq. (2.11) with Eq. (2.7), one finds 27r(2 + f3) = 27r/y / 2 + j3 
and the condition /3 = — 1. The only inverse power potential which yields a 
closed orbit is the potential V = —k/r. Closed orbits for two-body motion 
therefore exist only for the potentials V = nr 2 and V = —k/r where k and 
k are positive coefficients. 



Note 3 

The Eccentricity Invariant 

Two-body motion for which the force between the bodies is the inverse- 
square force / = — k/r 2 possesses an isolating invariant beyond the Galilean 
conservation law invariants. This invariant is the eccentricity 

e = p x M/kp — r/r. (3-1) 

The eccentricity vector lies in the plane of motion perpendicular to the 
angular momentum vector. Its magnitude e 2 = 1+2HM 2 / k 2 p = 1 —M 2 /h 2 
where h = ^k 2 p/{—2TL) is the Kepler constant appears in the family of 
conic sections in (r, <f>) coordinates: 

r(4>) = r 0 /(l + ecos</>). 

The properties of the eccentricity vector may be revealed through the 
Poisson bracket. If e is an invariant, its Poisson bracket with H must 
vanish. To show this, form the bracket of Eq. (3.1) with 7 i and use the 
property of Poisson brackets (which follows from the rules of differentiation) 
[. AB , C\ = A[B, C\ + B[A, C\ to obtain 

[e, H] = p x [M, H]/k^i + [p, H] x M/kp - [r/r, TC\. 

Since M is an invariant, it satisfies [M,H\ = 0 and the bracket [e,H\ 
reduces to 

[e,7f] = [p,H] x M/kp - [r/r, H\. (3.2) 

The Poisson bracket of any dynamical variable with Ti generates its time 
evolution. Hence [p,7f] generates Newton’s law of motion in Hamilton’s 
form [p,7f] = p = {. Represent the force as a general central force f = 
f(r)r/r and the angular momentum as M = rxp. Equation (3.2) then 
becomes 



[e,H] = r x (r x r)f(r)/kr — [r/r, Ti}. 



(3.3) 
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Both the double cross-product r x (r x r) and the Poisson bracket [r/r, TL\ 
appearing in Eq. (3.3) are actually different representations of the total 
time derivative of the unit position vector r/r. The Poisson bracket of the 
unit position vector with the Hamiltonian is manifestly so: 

-^(r/r) = [r/r, TC\. 

To see the connection between the double cross-product and the time 
derivative of the unit vector, expand it as 

x (r x r)^j = ( X{Xj — r 2 Sij)ij. (3-4) 

The time derivative of the unit vector may be formally calculated from 
Xi/r = Xi/ y/XjXj and, aside from sign and the scalar factor r 3 , yields the 
same result, 

— ( Xi/r ) = -( XiXj - r 2 Sij)xj/r 3 . (3.5) 

One therefore finds the elegant identity 

— (r/r) = — r x (r x r)/r 3 . (3.6) 

Combining the results of Eqs. (3.4)-(3.6) with Eq. (3.3), one obtains 

[e } n\ = — (l + r 2 /(r)/fc) ^ (r/r) . (3.7) 

For an attractive inverse-square central force /(r) = —k/r 2 , the right-hand 
side of Eq. (3.7) vanishes revealing the invariance of the eccentricity vector. 

Let us now consider the Poisson brackets of the eccentricity. The Poisson 
bracket of any two components of the eccentricity is 

h, ej] =[(p x M)i/kn , (p x M)j/kfi\ - [(p x M)i/kfi , Xj/r] 

+ [(p x M)j/kn,Xi/r ] + [xi/r,Xj/r]. 

The last term in Eq. (3.8) vanishes. The remaining terms may be estab- 
lished by direct calculation; but some simple theorems for Poisson brackets 
simplify the work. The Poisson bracket of a vector product A x M and a 
vector B can be shown to be 



[(A x M)i, Bj] — tiki [Afe, Bj]Mi — AjBi + (A ■ B)Sij. (3.9) 

One may use this result again to show that the Poisson bracket of a vector 
product with the angular momentum is 



[(A x M)i, (A x M)j] 

— ^ikl^jmn [A fc , A m ]MiM n - A 2 e ijk M k . (3.10) 
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Using (3.9) with A = p, one finds 

[(p x M)i/kfj,,xj/r\ - [(p x M)j/kfi,Xi/r \ = 2e ijk M k /kpr. (3.11) 

The result (3.10) may be used with A = p, and the fact that \p k ,Pm\ = 0 
to show that 

[(P x M) h (p x M)j] = -p 2 e ijk M k . 

The combination of these results yields 

[e», ej] = -( p 2 /k 2 p - 2 /kr)e ijk M k = -(2 H/k 2 p)e ijk M k , 

where H = p 2 / 2p — k/r is the total energy. The Poisson bracket [e 2 ,ei] is 
simply 2ej[ej,ei\. Hence, the eccentricity Poisson brackets are 

[e*,ej] = -(2 H/k 2 p)eij k M k , [e 2 ,e, : ] = {4H/k 2 p.)e ijk ejM k . 

The Poisson brackets of the eccentricity and angular momentum follow 
directly from the cyclic reproduction properties of the angular momentum: 

[e*, Mj] = €ij k e k , [M 2 , e»] = 2e ijk e j M k , [e 2 , Mf\ = 0. (3.12) 

The eccentricity e may be expressed in terms of its dimensional form, the 
eccentrum E, which has the same dimensions as the angular momentum: 

E = he. 

The components of E are real for bound motions (TL < 0) and imaginary 
for free motions {TL > 0). The Poisson brackets for E and M are 

= e.ij k M k , [Ei,Mj\ = Cij k E k , [£),£),■] = e.j j k M k . (3.13) 

As described in Chapter 4, the two three-vectors M and E represent 
four-dimensional rotations. Four-dimensional rotations may also be rep- 
resented by an antisymmetric four-matrix. Because of its antisymmetry, 
such a matrix has only six (rather than sixteen) independent components; 
and these six components are the six components of the two three-vectors 
M and E. An antisymmetric matrix F lx \ can be built from the two 
three-vectors as F 0 j = —Fj 0 = Ej for j = 1,2,3, and F^ = e t j k M k for 
i,j, k = 1,2, 3. The matrix F^\ is a unification of the eccentrum and angu- 
lar momentum in a single mechanical object: 



f 0 


Ei 


e 2 


e 3 


-E x 


0 


M 3 


— m 2 


-e 2 


-M 3 


0 


Mi 


\-e 3 


m 2 


-Mi 


0 



(3.14) 
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The Poisson brackets of the matrix F^x may be found from Eqs. (3.13) 
and are the matrix form of the cyclic reproduction property of rotational 
symmetry: 

[F^x,Fx v ] = F^. (3.15) 

The Poisson bracket (3.13) or (3.15) whose cyclic structure indicates a 
rotational symmetry show that the eccentricity and the angular momentum 
invariants E and M or, more compactly, the matrix F^x represent the 
group of rotations in four dimensions just as the angular momentum is a 
representation of the group of rotations in three dimensions. 

The matrix representing rotations in four-space is actually made up of 
two kinds of vectors. An ordinary vector changes sign if the coordinates 
are inverted, that is, x — > — x. Such vectors have come to be called polar 
vectors. However, a vector which arises as the cross-product of two polar 
vectors has the interesting property of not changing sign if the coordinates 
are inverted (the product of two vectors which change sign does not itself 
change sign). Such vectors are called axial vectors. 

In the language of parity, a polar vector has odd parity whereas an axial 
vector has even parity. A matrix representing rotations in four-space is 
made up of a polar vector E and an axial vector M. The eccentrum E is 
a polar vector because the term p x M does change sign upon coordinate 
inversion since M is an axial vector. The vector product p x M is the 
product of a polar and an axial vector. Hence, the invariants of the two- 
splreres supporting the four-dimensional rotational symmetry of the motion 
are divided into even ( M ) and odd (E) parity. 

The matrix F /X x exists independently of any particular coordinate sys- 
tem. Although the components representing a matrix such as F M x will 
change in the passage from one set of coordinates to another, a matrix pos- 
sesses certain quantities which are invariant to coordinate transformations. 
For F^x, the matrix invariants are M 2 + E 2 and M ■ E. These are precisely 
the Casimir invariants which appear in the symmetry conditions (4.8) when 
e is expressed in terms of E: 

M 2 + E 2 = h 2 , M ■ E = 0. (3.16) 



The reader familiar with the electromagnetic field will recognize the ma- 
trix F u x in the case of unbound motions as the image of the electromagnetic 
field-strength tensor (in its form in Minkowski space-time) 



1 0 


iE\ 


iE 2 


—iEi 


0 


b 3 


-iE 2 


-b 3 


0 


\-iE 3 


b 2 


-Si 




(3.17) 



with the role of the electric field iE played by the eccentrum E and that 
of the magnetic field B played by the angular momentum M. 
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The electromagnetic field equations are invariant to the transforma- 
tions of Lorentzian space-time (6.4). There is an elegant representation of 
Lorentzian space-time proposed by Herman Minkowski in which the field- 
strength tensor and the electromagnetic field equations may be expressed. 
In Minkowski’s representation the time component of the four- vector of an 
event is made imaginary as x$ = ict. The imaginary time component of 
the four- vector x = (ict,x\,X2,xf) then turns the Euclidean magnitude of 
a vector |x| 2 = X\X\ into the Lorentzian magnitude (save for the overall 
sign): 

|cc| 2 = -(cf) 2 + X\ 2 + X2 2 + X3 2 . 



Maxwell’s equations of electromagnetic theory can be written in Minkowski 
space as 



dF^x 
dx M 



Jx, 



where J\ is the four-current of charge. 

It is well known that the invariants of the electromagnetic field-strength 
tensor are E 2 — B 2 and E B. These invariants match the Casimir invariants 
for unbound motions of the two-body problem given by Eq. (3.16) when 
E and M appearing in Eq. (3.14) are related to the electromagnetic field 
variables as 

E — > iE, M — > B. 



Note 4 

Orbit Integrals of Kepler Motion 

The trajectory equations of Kepler motion drawn from the action result 
in the integral for the orbit in space relating r and <f> given by Eq. (4.50) 
and that for the orbit in time relating r and t given by Eq. (4.51). 

The orbit in space (4.50) is integrable in elementary trigonometric func- 
tions: 

r = ro/(l + ecos(j>). (4.1) 



This is the general equation for a conic section (Fig. 4-1) with the origin 
of coordinates r, <f> at one of the foci. The rectangular coordinates of the 
conic, 

x = r cos </>, X2 = r sin <f>, 



are given by 



?’o cos (j) 

1 + e cos <j> ’ 






ro sin </> 

1 + e cos (j> 



(4.2) 



It is readily observed from Eqs. (4.2) that in these coordinates all conics 
cross the X 2 axis at the same points <f> = ±7t/2 for which x\ = 0, X 2 = ±ro 
independently of e. They also all cross the x\ axis at x\ = r m j n = ro/(l+e); 
but this point depends on e and is different for different conics. 
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The bound orbits which correspond to the circle and ellipse cross the x\ 
axis at two points. The first which is closest to the focus at the origin is at 
(j) = 0 and corresponds to X\ = r m j n or 

(^l)min = ^min = ^o/(l T c). (4-3) 

The second which is farthest from the focus at the origin is at = n for 
which 

(^l)max = ^max = ^o/(l ^). (4.4) 



X2 




Figure 4-1. The Family of Conic Sections. Conic sections are described by two 
parameters such as the semi-latus rectum ro and the eccentricity e. In a coordinate 
system whose origin is at the focus, all conics cross the X 2 axis at a distance ro from the 
origin. The circle corresponds to the eccentricity e = 0, radius ro, and both foci lie at the 
origin. The ellipse (0 < e < 1) has one focus at the origin and the other displaced from it 
by 2ae where a = r o/(l — e 2 ) is the semi-major axis of the ellipse. The parabola (e = 1) 
separates closed conics from the hyperbolas which are open. The parabola crosses the 
x\ axis at a distance ro/2 from the origin. Hyperbolas stream to infinity crossing the x\ 
axis at distance a/(l + e) from the origin. 



The center points of the bound orbits occur at the points where X 2 
reaches a maximum or minimum. Setting 

dx 2 cos (j) + e 

dcj) (1 + ecos0) 2r ° 

one finds cos</> cent er = — e and cos0 max = 1. The x 2 coordinates of the 
centers of the bound orbits and the maximum values of X 2 directly follow 



as 



(*^2)center — ^T()/(l ^ )) (*^2)max — ^o/V^f • 



(4.5) 
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The results (4.3)-(4.6) along with the condition that all conics cross the 
x-i axis at ?’o may be used to establish the properties of the various cases. 
Circle (e = 0). The circle crosses the X\ axis at ro; it also crosses it 
symmetrically at X\ = — ro and has radius a = r o. 

Ellipse (0 < e < 1). From the two crossing points of Eqs.(4.3) and (4.4) 
one finds the semi-major axis of the ellipse is 

C = (^min T ^max)/2 — ^o/(l ^ )• 

The semi-minor axis is equal to (a^max from the second of Eqs. (4.5): 

b = ?’o / \/l — e 2 . (4.6) 

The ratio of the minor to major axis follows as 

b/a = \J\ — e 2 . 

Parabola (e = 1). The parabola separates the conics which have a finite 
extent (circle and ellipse) from those which stream to infinity (hyperbolas) . 
The parabola crosses the xi axis at x\ = r m - ln = ro/2. 

Hyperbola (e > 1). The hyperbolas cross the x\ axis at only one point 
at xi = r min = r 0 /(l + e). 

Note 5 

Inversion of the Kepler Equation 

The time integral of the orbit results in Kepler’s equation relating the 
eccentric anomaly if and the time t : 

ut = ip — esinip. (5-1) 

Newton showed that the solution of Kepler’s equation ip = ip(t) is tran- 
scendental: it is represented as an infinite series in t. This series may be 
expressed as a Fourier series: 

OO 

ip(t) = ut + a m (e) sinmLot, (5-2) 

m= 1 

with the Fourier coefficients a m given in terms of ip by 

1 f 27r 

a m = — (ip — 6) sin m9 dd , 

k Jo 

where 9 = Lot. Integration by parts of this expression results in 

(ip -9) 2tt i 1 r ^ 

a m = cos m.6 A / cos m9 dip / cos m9d9. 

mn o imr J 0 mir J 0 
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The first and third terms on the right vanish. Replacing 9 = u>t by e sin ip 
from Eq. (5.1) one is left with 

1 f 2 ” 

a m = / cos (imp — me sin ip) dip. (5-3) 

inn J o 

This integral is a well known function in mathematical physics known as 
the Bessel function: 

1 r 2 * 

J m {z ) = — cos (imp — z sin ?/0 dip. 

2?r J o 

The integral (and the Bessel function it represents) can also be represented 
as the infinite series, 



Jm{z) — 

fc= o 



(- T) k (z/2) m+2k 
k\(m + k)l 



(5.4) 



The Fourier coefficients are therefore Bessel functions: a m = 2 J m (me) / nr, 
and the solution of Kepler’s equation is expressible as a series in Bessel 
functions, 

ip(t) = tut + 2 Y s j n mL jt. (5-5) 

m— 1 



Note 6 

Kepler Motion on the One-, Two-, and Three-Sphere 

Kepler motion exists in a four-dimensional phase space with coordinates 
0 = (Ci,C2,C3,<4)- Because of the rotational symmetry of Kepler motion, 
the flow lies upon the three-sphere 

Ci 2 + C 2 2 + C 3 2 + CT = const. 

Moreover, because of the exceptional nature of four-dimensional space, the 
motion may be projected from this three-sphere onto a pair of families 
of two-splreres. Beyond that, one discovers this phase space is also the 
phase space of the Hooke oscillator: Kepler and Hooke motion may be 
transformed into one another! Here is how all these remarkable features of 
Kepler motion are interwoven. 

Kepler motion is most naturally exhibited in the parabolic coordinates 
unique to the inverse-square force (recall that splrerical-polar coordinates 
are not unique to the inverse-square force but are appropriate to all central 
force laws). The Kepler Hamiltonian (4.38) may be expressed in these 
coordinates as 
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The relative motion of the Kepler problem is six-dimensional. Since M3 is 
an invariant, the polar mode to which it belongs is immediately integrable 
and leads to a reduction of two dimensions in the phase space as described 
in Chapter 3 . This integrable mode has the polar action J $ and angle 
variable a $ = (f. The action is the invariant = M3 and the phase 
velocity is = 8H/dJ $ = M3 / fj.fr/ so that the separable mode is described 
by Eqs. ( 3 . 32 ) as 



<t> = ( M 3 /fjfr])t + const, = M3. 

One may remove the polar mode from the Hamiltonian by aligning the 
coordinates in the natural directions of Kepler motion in parabolic coordi- 
nates. These are the three orthogonal directions formed by the vectors M, 
e, and M x e of Fig. 4 - 11 . Let the X3 axis of the coordinate system be 
aligned with e. Then M3 = 0 and the Hamiltonian becomes 

4 9 k 

K = 7 — {&V 2 P + VP^/lfi) - ( 6 . 1 ) 

(Notice that this is in contrast to the development of the orbits in spherical- 
polar coordinates of Fig. 4-11 for which M was aligned with the X3 direction 
and 63 = 0.) 

With removal of the polar mode, the phase space is reduced to four di- 
mensions with coordinates (C, The Hamiltonian ( 6 . 1 ) is actually 

the much-disguised equation of a three-sphere showing that this space is 
rotationally symmetric. To see that Eq. ( 6 . 1 ) is the equation of a three- 
sphere, introduce the “squared” coordinates, 

qi 2 =f = r + x 3 , q2 2 = V = r-x 3 , 



in favor of f and 77 into the Lagrangian corresponding to the Hamiltonian 
( 4 . 38 ). The conjugate momenta are 

Pi = dL/dqi = 2y/fpz, p 2 = dL/dq 2 = 2^/r/p v , 

and the Hamiltonian is 



H = 



1 



(pi 2 /2fi+p 2 2 /2fi) - 



2 k 



2 1 2 V^- 1 - / r ' r* / r / 2 1 2 ’ 

qi z + q2 z qi z + q2 z 

which upon multiplication by (<Zi 2 + 92 2 ) becomes 

pi 2 /2fi + p 2 2 /2fj + (-2W)(<Zi 2 /2 + q 2 2 /2) = 2k. 
Rescaling the coordinates to 



( 6 . 2 ) 



Cl = q\ \fkfj/2, 
C 2 = q 2 \Zkfi/ 2 , 



C3 = Pi \/fc/(- 4 H), 

C 4 =P2y/k/(-4H), 
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and introducing the Kepler constant h 2 = k 2 p/(—2H), the equation of a 
three-sphere of radius \j2h embedded in four dimensions results: 

Cl 2 + C2 2 + C3 2 + C4 2 = 2 h 2 . (6.3) 

Kepler motion therefore lies upon a tlrree-splrere. But now a striking thing 
has happened. One finds in passing that Eq. (6.2) is also the Hamiltonian 
of a Hooke oscillator with Hamiltonian ^Hooke = 2 k and force constant 
k = — 27iKepier- Hooke and Kepler motions (the only two-body motions 
with closed orbits) are mutually transformable into one another! 

Let us now see how Kepler motion on the three-sphere projects onto 
two-splreres. This projection is a topological feature of rotations in four- 
dimensional space and their corresponding three-, two-, and one-spheres. 
The beautiful family of topological relationships between the three-sphere 
and its lower dimensional spheres was worked out by the topologist Heinz 
Hopf in the nineteen thirties and is known as the Hopf fibration* Begin 
by observing that the four coordinates in the equation of the three-sphere 
(6.3) can be written as a pair of complex variables, 

zi = Ci + < 3 , z 2 =(2 + 1(4, (6.4) 

in terms of which the Kepler Hamiltonian in the form (6.3) becomes 

ki| 2 + M 2 = 2 h 2 . 

Let w = (wi,w 2 ,ws) be the coordinates of the three-dimensional space in 
which a two-sphere is embedded. The projection of the three-sphere onto 
a pair of two-spheres graphically imaged in Fig. 4-4 is analytically imaged 
in the mapping 

W+ = (w 1 ,w 2 ,w 3 ) 

C = (Ci, C2, C3, C4) ^ , 

W~ = (wi,W 2 ,W 3 ) 

where w + and w~ are the coordinates of the two three-dimensional spaces 
into which the four-dimensional space projects. The Hopf map is most 
simply expressed in the complex representation 24 , z 2 - 

w 1 = z^z 2 + z x z 2 , w 1 = z\z 2 +z\z 2 , 

w + : w 2 = i{z 3 z 2 — z x z 2 ), u>~ : w 2 = i(z 3 z 2 — ZiZ 2 ), (6.5) 

w 3 = z{z x -z 2 z 2 , w 3 = zlz 1 -z 2 z 2 . 



* See, for example, H. Seifert and W. Threlfall, A Textbook of Topology, New York: 
Academic Press, 1980 and Chapter III of Raoul Bott and L. W. Tu, Differential Forms 
in Algebraic Topology, New York: Springer-Verlag, 1982. 
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Equations (6.5) contain two sets of coordinates. Each set satisfies the 
equation of the two-sphere, 

Wl 2 + w 2 2 + w 3 2 = (N 2 + N 2 ) 2 = 4ft 4 . 

The sets are the same except that z 2 in w + is interchanged with z 2 in w~; 
and it is this interchange that distinguishes one map onto the two-sphere 
from the other. The switch from z 2 to corresponds to a switch in the 
sign of £4 which means the two sets correspond to the pair of families of 
two-spheres into which the three-sphere projects. One set covers the three- 
sphere over the range £4 > 0 while the other covers the range £4 < 0. 

In a beautiful unfolding, the three-sphere of Kepler motion is mapped 
onto two-splreres, and, accordingly, the four-dimensional rotational sym- 
metry of Kepler motion is made equivalent to a pair of three-dimensional 
rotational symmetries. In the process Kepler motion is mapped into Hooke 
motion. 

Kepler motion is a bound integrable motion. It therefore consists of a 
closed orbit lying upon a three-torus. But the motion also lies upon a three- 
sphere. In general, it is not possible for motion to both lie upon a sphere 
and a torus; but in the amazing case of the three-sphere it is possible. Sim- 
ple orthographic projection maps the three-sphere onto a pair of families 
of two-splreres as shown in Chapter 4. Remarkably, it is also possible to 
map the three-splrere onto a family of nested two-tori (and two-exceptional 
circles corresponding to the poles). Each of the two-tori which foliate the 
three-splrere maps to a circle lying on a two-sphere. Thus, in addition to 
its foliation by a pair of families of nested two-splreres, the three-splrere 
may be foliated with a family of nested two-tori. The spherical foliations 
portray the rotational symmetry of Kepler motion while the toroidal folia- 
tions portray its integrability. Since integrability is the result of isolating 
invariants and the invariants flow from the symmetry, the two features are 
fully united in the three-splrere.* 



Note 7 

The Dirac Postulate 

P. A. M. Dirac conceived a brilliant postulate which links the commu- 
tator of quantum operators and the Poisson bracket of the corresponding 
dynamical quantities which these operators represent. This postulate is the 
critical link in the evolution of classical mechanics into quantum mechanics. 
Here is Dirac’s reasoning. 



* A detailed exposition of the way in which the three-sphere may be foliated with two- 
tori illustrated with a gallery of computer-generated images may be found in Hiiseyin 
Kogak et. al., “Topology and Mechanics with Computer Graphics,” Adv. Appl. Math., 
7, 282-308 (1986) and Hiiseyin Kogak and David Laidlaw, “Computer Graphics and the 
Geometry of S 3 ,” Mathematical Intelligencer 9, 1, 8 (1987). 
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dA dB dB dA 

dqi dpi dqi dpt 



The Poisson bracket of any two quantities A, B is 

[AB} = 

Poisson brackets behave as 

[A,B] = -[B,A], [A,A\ = [B,B] = 0. 

Poisson brackets also possess the properties 

[A\ + A2, B\ = {A\, B\ + [A2, B], [A, B\ + B2] = [A, B{\ + [ A , B2], 

which flow directly from the behavior of derivatives. Using these properties, 
the Poisson brackets in which one of the arguments is a product may be 
shown to be 

[A1A2, B] =[Ai, B]A 2 + Ai[A 2 , B], (7 1 ) 

[A,B 1 B 2 ]=[A,B 1 }B 2 + B 1 [A,B 2 }. 

The Poisson brackets in which both arguments are products A1A2 and 
B 1 B 2 can be formed in two ways. Using the first of Eqs. (7.1), there results 



[AiA 2 , BiB 2 \ = [Ai, Bi]B 2 A2+Ai[A2, Bi]B 2 

+B±[Ai, B 2 \A 2 + AiBi[A 2 , B2}, 

and using the second of Eqs. (7.1), 

[AiA 2 , B1B2} = [Ai, B 1 ]A 2 B 2 +Ai[A 2 , Bi]B 2 

+B\[A\, B 2 \A 2 + BiAi[A 2 , B 2 \. 

Since these two expressions are equal, it follows that 

{A\Bi — BiA\) _ {A2B2 — B2A2 ) 

[ApBr] ~ [A 2 ,B 2 ] 



(7.2a) 



(7.2b) 



(7.3) 



Now the pairs A\,B\ and A2,B2 are independent of one another. Hence, 
each of the two terms in Eq. (7.3) is independent of the other. They must 
each be the same constant. Thus, for any A, B it is true that 



(AB-BA)= 1 [A,B} i 



(7.4) 



where 7 is a universal constant. 

The constant in Eq. (7.4) can be shown to be pure imaginary with the 
following argument. Let C = A + iB be a complex quantity where A and 
B are real but not necessarily commuting quantities. The magnitude of C 
is then 

|U| 2 = A 2 + B 2 + i{AB ~ BA). (7.5) 




278 



The Shaggy Steed of Physics 



Since |C| 2 , A 2 , and B 2 are real in Eq. (7.5), the term i(AB — BA) must 
also be a real function of A and B , say 

i(AB — BA) = f(A, B). (7.6) 

Comparing Eqs. (7.4) and (7.6) it follows that f(A , B) = — i^[A, B\. Since 
[A, B\ and f(A , B) are real, 7 is a pure imaginary constant, say 7 = ih with 
h real, and the equality (7.4) can be written 

(AB — BA) = ih[A, B}. (7.7) 



Note 8 

Canonical Perturbation of Near-Integrable Motion 

The canonical perturbation of near-integrable motion takes as starting 
point an integrable motion in angle-action coordinates (a, J) whose inte- 
grable Hamiltonian is Hq(J) and phase velocity is tv = dH^/dJ. The mo- 
tion is assumed close to a nonintegrable motion whose Hamiltonian H(a, J) 
differs from that of the integrable motion by a small quantity H' (a, J): 

H(a,J) = H 0 {J) + H'(a,J). (8.1) 

Poincare proposed a procedure for obtaining an approximate solution to 
the nonintegrable motion by first neglecting quantities higher order than 
linear in the perturbation H'(a, J) and then making a canonical transfor- 
mation to new coordinates, 

(a, J) -> (a, J), 

such that the Hamiltonian in the new coordinates H(a, J) is a function only 
of the new action: H(a, J) = H(J). The motion would then be integrable 
in the new coordinates. 

Poincare designed the canonical transformation to achieve integrability 
in the new coordinates by making the new Hamiltonian H(J) equal to the 
old Hamiltonian plus the average of the perturbation Hamiltonian over the 
angle coordinates, 

H(J) = H 0 (J) + (H'(a,J)). ( 8 . 2 ) 

The average over all angle coordinates is defined as 

(H\d, J)) = (2 tt)- s / H'(d, J) d s d. 

Jo 

Poincare’s averaging proposal is the key to the procedure and its validity 
rests on the following argument. The phase velocity of the motion in the 
new coordinates is given by 



Co=^H{J) = a J +-t{H'{dJ)). 
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The phase velocities in the old and new coordinates differ by a small quan- 
tity. The perturbation is a “slow” motion superimposed upon the “fast” 
unperturbed motion. It therefore changes little over the time scale in which 
the angle coordinates execute one cycle of their periodic motions. The 
Hamiltonian perturbation is therefore approximately invariant over one cy- 
cle of the angle coordinates; and it is over such a cycle that the average is 
taken. The perturbation procedure fails if the phase velocity of the pertur- 
bation is comparable to the phase velocity of the unperturbed motion. 

The generating function of the transformation from H(a , J) to H(J) is 
the action 

S(a, J) = ajJj + S' (a, J). (8.3) 

It consists of the identity transformation ay .J 3 and a perturbation S' (a, J). 
The problem is to find S' (a, J) which transforms the given H(a , J) to H(J). 

The canonical transformation (8.3) is in the “old position, new momen- 
tum” format described by Eqs. (3.29) and (3.30) according to which the 
remaining coordinates are given by 



a = dS/dJ, 


J = dS/da , 


or 




d 

a = a + —=S'(a, J), 
oJ 


J = J+ (8.4) 

aa 


These equations may be inverted to terms linear (or first order) in the 


perturbations 




d 

a(a, J) « a - —=S’(a, J), 

OJ 


J(a,J)t*J+-?-S'(a,J). (8.5) 

oa 



Note that to this order it is permissible to replace a by a in S' (a, J). The 
Hamiltonian in the new coordinates H (a, J ) is the same as the Hamiltonian 
in the old coordinates, H(a, J ) = H(a, J). The old coordinate Hamiltonian 
on the right-hand side of this equation may be expanded into the new 
coordinates to first order as 

Qrr 

H(a, J) = H 0 (J) + — — J) + H'(a, J), (8.6) 

oJ 

where use has been made of the fact that to first order H'(a, J) « H'(a, J). 
The perturbation H{a , J) may be decomposed into its average value and a 
deviation from the average, AH' (a, J): 

H'(a, J ) = (H\a, J)) + AH' (a, J). (8.7) 

Substituting from the second of Eqs. (8.5) and from Eq. (8.7) into 
Eq. (8.6) and using the fact that u = DH 0 /dJ , one finds 

d 

H(a, J ) = H 0 (J) + u — S' (a, J) + { H'(a , J)) + AH' (a, J). (8.8) 

OCX 
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The Hamiltonian in new coordinates may now be made purely a func- 
tion of J by selecting it from Eq. (8.8) to be H(a,J) = H{J) given by 
Eq. (8.2). This then requires that the action perturbation S' (a, J) is given 
by Eq. (8.8) as 

u-O^S't a, J) = -AH' (a, J). (8.9) 

oa 

Equation (8.9) shows how the action perturbation is related to the 
Hamiltonian perturbation. The action perturbation may be explicitly ex- 
pressed in terms of the Hamiltonian perturbation through their Fourier 
expansions 



S' (a, J) = £) S' m (J)e i{& im,+a2m!+ " 45 '” ,l) (8.10) 

m 

and 

AH' (a, J) = 53 AF;(J)e i(Simi+ “ 2ra2+ - + “» m «). ( 8 . 11 ) 

m 

One may now differentiate Eq. (8.10) and insert it along with Eq. (8.11) 
into Eq. (8.9) to produce the action 



5*(cr, </) — otjJj T 



AJJ 1 |JJg!(aimi+Q 2 m 2 H h a 3 m 3 

m\Uj\ + 17120)2 + • • • + m s u> s 



( 8 . 12 ) 



from which all the properties of the motion may be found. 



Note 9 

Hydrogen Atom Spherical-Polar States 

It is possible to describe the eigenvalues of the hydrogen atom — the en- 
ergy levels and the angular momentum and eccentricity values — with min- 
imal reference to a coordinate system. However, a coordinate system must 
be incorporated into the description to render the eigenstates in detail. 

The complete symmetries of the hydrogen atom are manifested in two 
coordinate systems: spherical-polar and parabolic. Corresponding to these 
coordinates are the two state specifications, the one vector with Casimir 
set (Tt,M,M 3 ), the other spinor with (7i, E 3 , M 3 ). 

The angular momentum is the bearer of the central force symmetry. 
The central force symmetry is manifested in spherical-polar coordinates 
and the angular momentum eigenstates are naturally revealed in these co- 
ordinates. The first task in revealing the spherical-polar eigenstates of the 
hydrogen atom is to establish the angular momentum and energy operators 
in spherical-polar coordinates. The spherical-polar coordinates r, 0, <f> of 
Fig. 4-9 are related to the rectangular Cartesian coordinates by 

a,’i = rsin0cos<^>, X 2 = r sin 9 sin <j>, x 3 = rcosd. (9-1) 
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The operators M 2 and M 3 may be calculated from the definition Mi = 
—iheijkXjd/dxk with Xi and d/dxi expressed in terms of the spherical- 
polar variables using Eq. ( 9 . 1 ). The angle <j> describes rotations about the 
polar axis and the angular momentum represents the rotational aspects of 
space. The polar angular momentum M 3 = therefore has the operator 
representation 



M 3 = —ih 



f d d \ 
~ X2 dx 1 ) 




( 9 . 2 ) 



The operator M 3 represents pure rotations about the polar axis. For the 
total angular momentum operator one obtains in the same manner 



M 2 = -h 2 



1 d 
sin 9 89 




1 d 2 ' 
sin 2 9 d(j) 2 _ 



( 9 . 3 ) 



The total angular momentum consists of a contribution from the angular 
variation in 9 and a contribution from M 2 representing the rotations in <j> 
about the polar axis: 



M 2 = -h 2 



1 d 
sin 9 d9 





Ml 



( 9 . 4 ) 



Equations ( 9 . 2 ) and ( 9 . 4 ) may be compared with their classical coun- 
terparts, Eqs. (4.27) and (4.28): 



M 3 = Pq i, M = Pg+ . 2 M 3 

sin 9 



and the operator p 2 s is found to be 



Pe = 



1 d 
sin 0 d9 



slnS l> 1 ■ 



The total angular momentum operator may therefore be expressed in 
terms of the momentum operator pg and the polar momentum operator 
M 3 in a manner that images the classical connection: 

M 2 =p 2 + ^-M 2 . ( 9 . 5 ) 

sin 9 

The operator M 2 is also directly related to the angular portion of the 
Laplacian operator in spherical-polar coordinates: 

M 2 = -fiVvi. 



( 9 . 6 ) 
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The angular portion of the Laplacian is 



Vi = 



'89 



sin 6 



86 



1 



d 2 



r 2 sin 2 6 8<p 2 



( 9 . 7 ) 



and is itself composed of angular parts in 6 and <f> which match the to- 
tal squared angular momentum as a composition of p 2 s and M 2 given by 
Eq. ( 9 . 5 ). 

Having established the angular momentum operators, let us now turn 
to the energy in spherical-polar coordinates. The kinetic energy operator 
p 2 / 2p is directly related to the Laplacian operator V 2 (Chapter 5): 



p 2 /2p = — (?i 2 /2/i) V 2 . 



( 9 . 8 ) 



In splrerical-polar coordinates, the Laplacian is 



V 2 



^8_ 
r 2 dr 




1 8_ 

r 2 sin 6 89 




1 8 2 
r 2 sin 2 9 dip 2 



( 9 . 9 ) 



In two-body motion the kinetic energy is decomposable into a radial 
portion p 2 / 2p and an angular portion M 2 / 2 pr 2 as described in Chapter 3. 
This decomposition also holds true for the operator representation of these 
variables. The radial energy operator 



Pr/ 2 P 



*Hi, (r?P-\ 

2p r 2 dr \ dr J 



corresponds to the radial portion of the Laplacian while the angular energy 
operator 



M 2 / 2 pr 2 



n 2 \ i d f . d\ 

2/r r 2 sin 9 89 \ 89 J 



1 d 2 ' 
r 2 sin 2 9 dcp 2 _ 



corresponds to the angular portion. The decomposition of the kinetic en- 
ergy operator 

p 2 / 2/i = p 2 J 2/r + M 2 / 2 pr 2 

bears a one-to-one correspondence to the decomposition of the Laplacian 



V 2 



1 ^(r 2 ^) 

r 2 dr \ dr J 



+ V 



2 

_L * 



The total energy or Hamiltonian operator 7 i. = p 2 /2p — k/r is 
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where So = —k 2 p,/ 2fr 2 is the ground state energy and ao = Ti 2 /kp, is the 
Bohr radius. The Hamiltonian operator may also be expressed in terms of 
its component parts as 



n = s o 



(-) 2 t 




1 

h? 




M 2 + 2 — 
r 



(9.11) 



The fact that the Hamiltonian operator splits into a radial and two angu- 
lar portions is the manifestation of the separation of the Hamilton- Jacobi 
equation for the two-body problem on the quantum level. 

The three operators H, AI 2 , and M 3 which describe the spherical-polar 
states of the hydrogen atom are now established. The eigenvalues of these 
operators have been determined in Chapter 5. These eigenvalues are char- 
acterized by three quantum numbers n, l, m: 

H = S 0 /n 2 , M 2 = l{l + 1 )Ti 2 , M 3 = mh. (9.12) 



The eigenstate corresponding to these eigenvalues is symbolized as 'i’nim- 
The operator equations for the eigenstates are therefore 

n im — mhA! nlmj 

M 2 ^ nlm = l(l + l)h 2 ^ nlm , (9.13) 

ft^nlm = (S 0 /n 2 )^ nlm . 

The integrable nature of two-body motion on the classical level [the fact 
that one can separate the Hamilton-Jacobi equation into pure r, 9 , and 
(j> components for which S(r, <j>, 8) = — St + S r (r) + S^cji) + Sg(6) and the 
ordinary differential equations are directly integrable] is also manifested on 
the quantum level. The eigenstates for quantum two-body motion described 
by the Hamiltonian (9.11) are separable. The eigenstate ^(r, 9, (f> can be 
expressed as a product of functions which each separately depend upon r, 
(/>, and 9: 

^ nim^r, (J), 9^) — FL n i (r) 0 / m (^) < h m ( 0 ) , 

The corresponding operator equations (9.13) for M 3 , M 2 , and H also split 
into ordinary differential equations separately governing ( 0 ), Qim(9), 
and R n i(r). For <F m , one finds 

—ih — -rj - = ( 9 - 14 ) 

dq) 

The eigenstates are normalized by the condition J 4 'nirn^ nim < ^ x = which 
therefore requires that the solutions of Eq. (9.14) be normalized by the 
condition 




284 



The Shaggy Steed of Physics 



The normalized solutions of Eq. (9.14) are 

= -4= e< "** (9.15) 

V 27T 

The eigenstates for the motion of the electron about the nucleus must be 
periodic in azimuthal angle: 4> m (</> + 27r) = 4> m (</>). Only those eigenstates 
with integer values of m are therefore required for the hydrogen atom. 

Using Eq. (9.14) to eliminate the (f> dependence in M 2 given by Eq. (9.3), 
the operator identity for M 2 (9.3) becomes an equation for O i m (&)' 



1 d f . Q dQi m 
s\n9dd V md ~de~ 



+ W + 1)0 im — 0 - 



(9.16) 



Although the eigenfunctions which solve (9.14) can be found as elementary 
exponential functions (9.15), the solutions of Eq. (9.16) are not elementary 
functions. This differential equation can be thought of as the defining 
condition for the functions Qi m (9). The functions O i m (6) which are periodic 
in 6 with period 7 r, normalized by the condition 



/* 7T 

/ ©Tm-Oim sin 9 dd = 1, 
Jo 



and satisfying Eq. (9.16) are the normalized associated Legendre polyno- 
mials: 

In the above expression the factor (— l) m is omitted for m < 0. The Leg- 
endre polynomials are defined by the recurrence relations 

1 r ll+m 

P, m ( cos 0) = - J- sin m 6- -J— (cos 2 9 - 1)*. 

1 y ’ 2 H\ dcos6 l + mV ’ 

The first few angular eigenfunctions in 6 are 



©oo — -y=, 



©in = — — cos( 



©1 + 1 = — — sin0, 



©20 = -yp(3cOS -9 - 1), 
n 

H 2 ±i = — sm 9 cos 9, 

n v'lS . 2 
©2±2 = —j— sm 0. 
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Using both Eqs. (9.14) and (9.16) to eliminate the <j> and 9 dependence 
from Eq. (9.11), the operator equation for the energy, the last of Eqs. (9.13) 
with 7 i. given by (9.11) becomes an ordinary differential equation in r: 



jLA 

r 2 dr 




1(1 + 1) 

r 2 



Rnl o 




Rnl = 0. 



(9.17) 



The normalized functions R n i(r ) which satisfy Eq. (9.17) and remain finite 
at r — > 0 and vanish for r — » oo then can be found in terms of a pervasive 
function in mathematical physics: the confluent hypergeometric function 
F“(z). The confluent hypergeometric function is defined by the series 



F*(z) = 1 + 



a z 

7 1 ! 



a(a + 1) z 2 

7(7 + 1) 2! + 



(9.18) 



This series is a terminating polynomial when a is a negative integer which 
is the case for the hydrogen atom. For the integer case with a = —n and 
7 = m, these polynomials may also be defined by the recurrence relation 



F~ n (z) 



(- 1)™- 1 

m(m + !)••• (to + n 



jm+n- 1 

— e ~ z — (e z z n ) 

1 ) dz m+n - l{ ~ ' 



(9.19) 



The normalization condition for the radial dependence is 

R 2 nl r 2 dr = 1. 

With this normalization condition, the radial eigenfunctions are expressible 
in terms of the confluent hypergeometric function as 




Rni(r) = 



2a, 



-3/2 



n l+2 (2l + 1)! 



^ )! 0! (2 r / a o);e - r/ " 0o F 2 7|r 1)+i(2r -M°)- 



(9.20) 

For this particular combination of exponents, the confluent hyper geomet- 
ric function is better known as the Laguerre polynomial and the radial 
eigenfunctions may be expressed alternatively as 



2 a ~ 3/2 

Rni(r) = - 0 



-iZ+2 



(n — l — 1)! 
(n + 1)! 3 



(2r/ao) l e -^ nao L 2l + 1 (2rinao). (9.21) 



The functions L™(z) are the associated Laguerre polynomials defined by 
the recurrence relations 



L'?{z) = {-ir- r ^- T e*z- m d 

( n — m)\ 



(e~ z z n 

dz n ~ m K 



)• 



(9.22) 
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The first few radial eigenfunctions 



are 



1 - - 
2 



r 



e -r/2a 0 



R 10 = 2 op 3 / 2 e- r/ao 

i?2° = ~^ a 0 3/2 

*“ " 57 S“» 3,a (i) 

R 31 = — 3/2 ( — ) [l - l ( — )] e~ r/3a °, 

27^6 ° \a 0 J [ 6 \a 0 J\ 

i? 32 = — 3/2 ( T ' ] e~ r/3a °. 

81^30 0 \aj 

rhe succession of spherical-polar eigenstates for the hydrogen 
ds as 



atom pro- 



'I'ioo : 
d'200 : 



1 n ~ 3 / 2 r - r / a o 
' /= u 0 e > 

1 - 3/2 L 

= 7 = a, 

4 V2n 



2 - 

r 



' r 
V a o , 



e -r/2a 0 , 



^210 = L- a o 3/2 ( — ) e r/ 2 a °cos 6 ». 

4V2tt V °o / 

/oij .1 = —a~ 3/2 ( — ) e ~ r / 2a ° sin 0e ±i<?!> , 



t r 21±l 

V I / 300 : 



: 8^ a ° 



r 



1 



■■a, 



-3/2 



" 3y/3n' 

2 V 2 -3/2 / r 

"27^ a ° Vao, 

2 -3/2 / r N 

' 27^ a ° ' 



2 

3 



1- r 



2 / r 



\ a o J 
1 1 



‘ 27 



6 Vao, 



e ~r/3a 0 ^ 
COS 9, 



1 

2 8176^^ 
1 



-3/2 



. 00 / 

' \ 2 
r 



1 - - 
6 



r 

,ao. 



a 0/ 

e -r/3a 0 ( 

e _ r / 3a ° sin 9e ±i( ^, 



-3/2 / r- \ 2 
= 81^ a ° ' - ’ 



— ) e _r / 3a ° (3 cos 2 

,o 0 ) 

e -r/3a 0 



1 

162-y/Tr' 



-3/2 
a o 



00/ 

r ' 2 



M e -r/3a 0 ^;„2 

,ao / 



0 - 1 ), 
sin 9 cos 9e ±l ^ , 
sin 2 6>e ±i20 . 
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The parity of spherical-polar states may be established in the following 
manner. When the coordinates Xi are inverted as Xi — > —Xi, spherical-polar 
coordinates undergo the transformation 

r — » r, <fi <fi + 7T, 0 — > 7r — 0. 

It can be seen from the forms of 4> m (0) and 0/ m (0) that these functions 
undergo the transformation 

Pr (cos 0) -► (— l) i_I7l p ; m (cos 0), 

g im(j> ^ ^ -^771^27710 

As a result, the state function which is the product of these two 

eigenfunctions undergoes the transformation 

e im<Jpm( cos(9 ) _ (_i)i e im^pm(cos6l) 

or Tnjm — > (— 1 ) l 'Sf n lm- The parity V of spherical-polar eigenstates therefore 
turns on the single quantum number l and is given by 

V = (- 1 )'. 



States of even l have even parity; states of odd l have odd parity. 



Note 10 

Hydrogen Atom Parabolic States 

The coordinates in which the spinor eigenstates are manifested are the 
parabolic coordinates (Fig. 4-10) given by 

xi = pcos<j), X2 = psixi(f>, X3 = (£ - r])/ 2 , (10.1) 

where p 2 = X \ 2 + X2 2 = £ 77 . The magnitude of the position vector in 
parabolic coordinates is 

r = \J Xi 2 + x 2 2 + X3 2 = (£ + rj)/ 2 . 

The inverse relationships between the two sets of coordinates are 

^ = r + x 3 , 77 = ? X3, (p = tan~ 1 (x 2 /x 1 ). (10.2) 

The relationships between parabolic and spherical-polar coordinates are 
also of interest. The polar angle coordinate <f> is common to both systems. 
The parabolic coordinates £ and 77 are related to the spherical-polar coor- 
dinates r and 0 by 



£ = r( 1 + cos 0 ), 
77 = r(l — cos 0 ). 



(10.3) 
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Figure 9-1. Hydrogen Atom Spherical Polar States. States identified by quantum 
numbers ( n,l,m ) are exhibited as probability density surfaces. Multiple surfaces of 
relative maxima in the probability density for the principal levels 200 and 300 are shown 
which locate nodes between them. Atoms in the 211, 311, and, to a lesser degree, the 
322 state have been rotated for better viewing. ( Courtesy George D. Purvis III) 
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Since the polar angle <j> is common to both parabolic and spherical-polar 
coordinates the momentum operator p ^ = M 3 is similarly common to both 
and given by Eq. (9.2): 

M3 = — ihd/dq i>. 

The total energy operator Ti and the eccentricity operator e, however, must 
be given a parabolic coordinate representation. 

The classical Hamilton- Jacobi equation provides a direct transcription 
for the energy operator. The classical Hamiltonian in parabolic coordinates 
given by Eq. (4.38) goes over into the Hamiltonian operator 



* = (h (&l + /M + w/^ ~ rk- 

The operators and p 2 may be developed from the relationships 






2 = -h 2 



d 



i + pdti V ^ 



d 



^2 _ fc 2 1 ® 

Vr >~ h Z + V dr, 



drj 



(10.4) 

( 10 . 1 ): 

(10.5) 



The eccentricity operator S 3 in parabolic coordinates may be calculated 
for S 3 given by Eq. (5.43): 



S 3 = (piM 2 - p 2 M 1 - ihp 3 )/kp - x 3 /r. 



This operator may be expressed in parabolic coordinates in a manner that 
images the classical eccentricity, Eq. (4.43), as 

e 3 = ^ (p 2 v -pQ Am + ^$Am - (10.6) 

By combining Eqs. (10.4) and (10.6) one may separate these equations 
into pure functions of £ and ?y which follow those for the classical separated 
Hamilton- Jacobi equation (4.45): 

S 3 =1 + t (i-L/k - 2 p\/kp - M 3 / 2 k p^ 2 ) , 

V 2 ' (10.7) 

—S 3 =1 + r, (kt/k - 2 p 2 /kp - M 3 /2kpp 2 ^J . 

The operators kt, M 3 , and S 3 all commute. Moreover, the eigenvalues of 
these operators, established in Chapter 5, are 

H=£o/n 2 , M 3 = mTi, e 3 = q/n, 

where n takes all the integers greater than zero and q and m take all the 
positive and negative integers including zero. The wave function has the 
separable form 



^ nqm = X{Z)Y(rj)* m {<t>), 



( 10 . 8 ) 




290 



The Shaggy Steed of Physics 



where the polar eigenstates are given by Eq. (9.18). The sepa- 

rated operator equations for the eccentricity eigenstates which follow from 
Eq. (10.7) are then 



1 d_ (r dX \ _ 1_ ( J_ _ (• n - g) Q o rr^_a^\ 

£ df \ dt; J Qq \ 4 n 2 n 2£ 4 £ 2 ) 



1 d / dY\ 
7 1 dr l \ dt] ) 



^ _ (n + g) a 0 + m 2 o|\ y = Q 

Oq \4n 2 n 277 4 ifj 



(10.9a) 



(10.9b) 



Equations (10.9) are the governing equations for the spinor eigenstates 
in parabolic coordinates. These equations are formally identical save for the 
sign of the quantum number q so the two functions X and Y are the same 
function with values of the quantum number q of opposite sign. Parabolic 
quantum numbers q + , g_ are useful in this representation. They are defined 
in terms of q and in as 



(n — 1 — |m|) + q (n — 1 — |m|) — q 

Q+ = 2 ’ Q ~ - = 2 

and are fixed by the triplet ( n,q,m ). The state specified by ( q + ,q_,m ) 
is equivalent to that specified by (n, g, m) since the eccentrum quantum 
number q is given in terms of (q+ , g_ , to) by 



q = Q+ - q~, 

and the energy level quantum number n by 

n = q+ + q- + \m\ + 1. 

The quantum number q may take positive and negative integers lying be- 
tween — (n— 1) and (n— 1) but the quantum numbers q± are always positive 
and take integer values in the range 



0 < q± < (n - 1). 

One sees that X(^) = X n q_ m (£) and Y{rf) = X nq+m (r]). One may 
therefore express the eigenstate in terms of the single function X nq±m (w) 
as 

4^ nqm — X nq _ m (£,') X nq + m (?^)4 > m (<()) . 

Note that it is q which appears in 4' ngm while q± appears in X nq±m (w) . The 
angular momentum quantum number m appears only as to 2 in Eqs. (10.9) 
so the state functions X nq±m depend only upon the absolute value of to. 
The parabolic eigenstate equations have a structure similar to that of the 
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Laguerre differential equation ( 9 . 17 ) governing the radial eigenstates R n i(r) 
in spherical-polar coordinates. The solutions of these differential equations 
are all expressible in terms of the confluent lrypergeometric function of 

Eq. ( 9 . 18 ). 

The volume in parabolic coordinates is drx = (£ + rf) drj d(j)/4 and the 
parabolic eigenstates are normalized as 

j j Xnq-m(€)X nq+ m(T])X n q_m(€)Xnq +m (T)) ^ dt;dr)=l. 

Both the £ and eigenstates are represented in terms of the same function 
but with different arguments and quantum number indices q±. The 
solutions to Eq. ( 10 . 9 ) for the states X nq± \ m \(w) are 



v < n_ 2 1/4 a 0 3/4 l(q± + |rn|)! , \\m \/2 

X nq± \m\( w ) — n | m || y {w/na o) 

x e -w/2na OF - m ^ {w/nao y 
The first few eigenstates X nq± \ m \ (w) are 



( 10 . 10 ) 



*iooM =2 1 /% 3/4 e~ w ' 2a °, 
2 1 /4 

X200{w) : 



^_~ 3 / 4 p -w/4ao 
2 tt 0 e ’ 






z, l ■=- V 

y / X 2 1/4 -3/4(t") 1/2 

Xm(w) = U) 

XsooM 

y. / X 2 1 / 4 —3/4 

-*310 M = - Uq 

Y / X 2 1 / 4 -3/4 

X 32 o ( w ) ~—^~ a 0 



\ A / 

e - W /4a 0) 

ao/ 



l-i(” 

3 \ao 

x _ 2 / w 
3 



ao/ 18 \«o. 



ii;/6ao 



A 320(w) a 0 1 _ 3 

Y / x 2 1 / 4 _ 3/ 4 / ffi\ 1/2 

X 30I W=^a„ (~) 

2 1 / 4 V / 2 _ 3/ 4 f w \ 1/2 

Xsn( "' ) -^cr a » UJ „ 

Y ■ 2 1/4 V / 2 3/4 / 10 \ w /6an 

X 3 02{w) - - a 0 



e ~w/6a 0 

1 






c —w 



)/6a 0 



l - t f — 

6 \ a o. 






)/6a 0 
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The first few parabolic eigenstates 4 r ngm (£, ? 7 , 0) [with the complete state 
function 4' ngTra (£, 77 , (f>) expressed in terms of the quantum number q rather 
than the quantum numbers g±] are 



4' 1 oo =^a” 3 / 2 e- ( f +?? )/ 2ao , 



/7T 






20±1 — 



4^210 — 






2-10 — 



1 / f \ V 2 / \ V 2 

J _ a ~ 3/2 ( — j ( — ) e ~ (J+r ' )/ 4 ao e ±i,? \ 

8 a / 7 t 0 \a Q J \a 0 J 
-^a" 3/2 fl- 1 ^ 

1 

vr 



-3/2 



iTr 1 —3/2 

4' 300 = — 






31±1 — 



'9v^F 
\/2 _ 3 /2 / £ 
270t a ° Uo 



2 \ao 

l-ifU 

2 \ a o 

3 \ a 0 



p -(C+'/)/4a 0 



p -(C+'/)/4a 0 



l--\ 


( rj_Y 


l 3 1 


ya 0 )_ 



r® 



_ \/2 „-3/2 / £ 



* 3 " 1:tl ^2Vft a ° 



\ 1/2 



ao/ 



1/2 



1/2 



1 - 



1 - 



g-(£+»))/6ao 

1/f 



6 \ao 
1 / 77 



T 3 o ± 2 =— ^«o _3/2 (-) (-} e-(^ )/6ao e ±i2 ^ 
162 Vtt \ a o/ \ a o 



,Tr —3/2 

4' 320 =- ~ «r 



"9^ 



£ 



41 3—20 — 



-3/2 



9^ 



3 V a o 

2 fri_ 

3 \a 0 



1 n 



1 ^ ^ 



1 - 



18 \ao 

{ Jl 

18 V a 0 



6 \a 0 
e 
2 



e -(5+'7)/6oo e ±i</> 



o -(i+v)/&a 0 e ±i<j> 



p -(C+^)/6a 0 



p-(C+»7)/6ao 



The common factor (£ + 77 ) /2nao appears in all the parabolic eigenstates; 
but in parabolic coordinates the factor (£ + 77) /2 is the magnitude of the 
position vector r as indicated by Eq. ( 10 . 1 ). The exponential decay terms 
are therefore of the form 

p -(i+v)/2na 0 __ —r/nao 



and are identical to the exponential decay factors of the state functions 
'i’nim in spherical-polar coordinates. Since these factors depend only upon 
the energy level quantum number n and this quantum number is common 
to both state specifications, the two factors are identical. Moreover, since 
the ground state is completely symmetrical in both state specifications, 
the ground state functions T 100 are identical for both spherical-polar and 
parabolic coordinates. 

There are no parabolic states of the form 4'„oo when n is even. It is not 
possible for the angular momentum and eccentrum to vanish simultaneously 
in such states. 
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Figure 10-1. Hydrogen Atom Parabolic States. Parabolic states are identified 
by the quantum numbers (n,q,m). The parabolic states 100, 201, and 302 are identical 
to the ( n,l,m ) spherical-polar states 100, 211, and 322 of Fig. 9-1 but are shown here 
from a different angle. The polar axis is the symmetry axis of the structures. The 
atoms have been rotated for the best viewing angle. (Courtesy George D. Purvis III) 
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Consider now the parity of the parabolic states. The polar eigenfunc- 
tion is common to both the splrerical-polar and parabolic states. Upon 
coordinate inversion, the polar angular eigenfunction undergoes 

^ ^ i 

The parity of the parabolic eigenstates may be established by noting that 
on coordinate inversion x t — > — Xi, the radial coordinate r is unchanged and 
the parabolic coordinates undergo 

£ = r - x 3 -> r + x 3 = rj, 
r/ = r + x 3 — y r — x 3 = x, 

<f> — > <j> + 7T. 

Coordinate inversion therefore interchanges f and ?y. 

The eccentricity is a function of odd parity (it changes sign when the 
coordinates are inverted). When its quantum number q is inverted, the 
quantum numbers undergo the change 

q -> - Q , 
q+ ->■ q~, 
q- q+- 

Since \m\ does not change sign, inversion of the quantum number q results 
in an interchange of the parabolic eigenfunctions 

X n q_ \m\ (0 * -X-nq+ \m\ (v), 

Xnq+ \m\ (v) * X n q_ \m\ ( 0 - 

Thus, when coordinate inversion switches the arguments of X nq _\ m i(£) and 
X nq ,\m\{ , n)i inversion of the quantum number q returns these functions 
and arguments to their original state. The eigenstates 4 > nqrn involve the 
product of the two parabolic eigenfunctions X nq _\ m \(f i ) and X nq+ \ m \(if) 
and the polar eigenfunction. They are therefore transformed under parity 
as 

Vi/ —j. 

The parity V of parabolic eigenstates therefore turns on the quantum num- 
bers to and q. 
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